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Preface 



The pioneer spirit is still vigorous within this nation. Science offers a largely 
unexplored hinterland for the pioneer who has the tools. . . - (Vannevar 
Bush, Science, The Endless Frontier: A Report to the President, July 1945] 



The words of Vannevar Bush have not lost currency in the intervening 
four decades. But his 1945 report testifies to a further proposition: behind 
most scientific explorations stand committees on research, responsible for 
seeing that the tools of science are kept current, in adequate supply, and 
available to those who can use them most productively. These responsi- 
bilities call not only for short-term decisionmaking on a monthly or other 
periodic basis, but also for occasional sweeps of the horizon, to absorb the 
lessons of the past and plan thoughtfully for the future 

The Committee on Basic Research in the Behavioral and Social Sciences 
was established in early 1980 at the request of the National Science Foun- 
dation and operates under the auspices of the National Research Council's 
Commission on Behavioral and Social Sciences and Education. The com- 
mittee's first task — to assess the value, significance, and social utility of 
basic rerearch in the behavioral and social sciences — was designed to re- 
spond to questions posed to the foundation, principally by its congressional 
overseers, on a fairly short-order basis. These inquiries required a systematic 
look at the nature and methods of research in these fields and specification 
of the criteria by which a national interest in support of basic research could 
be established. This first phase of committee work resulted in the publication 
of Behavioral and Social Science Research: A National Resource (National 
Academy Press, 1982). 

Carrying out that initial task meant devoting a relatively small proportion 
of the committee's time to considering the longer-term trends of research 
advances in behavioral and social sciences, although these were reflected 
to some degree in thz 1982 report. The present volume, fruit of the second 
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phase of committee activity, is largely devoted to assessing such trends. 
Symbolizing this interest, the papers in this volume were presented first at 
a commemorative public symposium held November 29-30, 1983, marking 
the fiftieth anniversary of the publication of Recent Social Trends in the 
United States (McGraw-Hill, 1933), the landmark report of the F^esident's 
Research Committee on Social Trends. The research committee, appointed 
by Herbert Hoover in 1929 to investigate the overall condition of the nation, 
was comprised entirely of social scientists. Economist Wesley C. Mitchell 
was chair of the committee, and political scientist Charles E. Merriam was 
vice-chair. The dominant voice proved to be that of sociologist William F. 
Ogburn, the director of research. Recent Social Trends, with its 29 sepa- 
rately authored chapters, nearly 1,600 pages, and foreword by President 
Hoover, was soon labeled and has since been informally referred to as the 
Ogburn report. 

This volume is inspired by the Ogburn report in several ways. The study 
of social trends has continued to be a major research area across many of 
the behavioral and social sciences. Four chapters in this volume highlight 
advances in theories and methods devoted to understanding social, orga- 
nizational, and economic change since the Hoover era. A second theme is 
the increasing use of quantitative concepts and data in decisionmaking, 
explored in three chapters on the use of numbers in democratic political 
systems, criminal justice policy, and individual choice behavior. A final 
theme is the remarkable growth of the study of cognition and behavior, 
covered in chapters on child development, language, and visual perception. 
Each of the 10 thematic chapters is a vivid portrait of newly gained knowl- 
edge, taken from a particular perspective; as a whole, the volume is a 
selective sampling from the gallery of behavioral and social science ac- 
complishments of the past 50 years. 

The idea that our committee might take 'the Ogburn report as a reference 
point for this phase of its work was first suggested by Otto N. Larsen, 
senior associate for social and behavioral sciences at the National Science 
Foundation. It is a pleasure to acknowledge his role and that of the foun- 
dation generally in providing a continuing and substantial commitment of 
intellectual and material support to the committee; we particularly wish to 
acknowledge the contributions of Eloise E. Clark, formerly assistant director 
for biological, behavioral, and social sciences; James H. Blackman, for- 
merly acting director of the Division of Social and Economic Science; and 
Richard T. Louttit, director of the Division of Behavioral and Neural Sci- 
ences. 

We are indebted to the staff of the National Research Council for ren- 
dering many services during preparations for the symposium and this report. 
In particular, David A. Goslin, executive director of the Commission on 
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Herbert Hoover, in his preface to the report of the President's Research 
Committee on Social Trends (1933), explained that he had asked a "group 
of eminent scientists to examine into the feasibility of a national survey of 
social trends ... to undertake the researches and make ... a complete, 
impartial examination of the facts." Hoover noted that the committee's 
report on the findings compiled by their many experts "should serve to 
help all of us to see where social stresses are occurring and where major 
fforts should be undertaken to deal with them constructively." 1 The focus 
of this distinguished committee of social scientists (the term behavioral 
science had not yet gained currency) and the hundreds of consultants who 
contributed to the report was to document the state of the nation, especially 
in terms of changing institutions, and to make such recommendations as 
seemed appropriate for public policy or private action. The most notable 
aspect of the 1,600-page report was its unified view (President's Research 
Committee on Social Trends, 1933, pp. xii-xiii): 



The members of the committee were Wesley C. Mitchell, chair, Charles E. Merriam, vice- 
chair, Shelby M. Harrison, secretary-treasurer, Alice Hamilton, Howard W. Odum, and William 
F. Ogburn. The executive staff included Ogburn as director of research, Odum as assistant director 
of research, and Edward Eyre Hunt as executive secretary. Although President Hoover initiated 
and appointed the research committee, funding for its investigations was provided by the Rockefeller 
Foundation. Substantial services and personnel were provided by the Social Science Research 
Council and the Encyclopedia of the Social Sciences. The list of acknowledgments of other 
institutions and individuals assisting in the work ran to 12 pages. For accounts of the complex 
dynamics of the committee, see Karl (1969, 1974). 
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It may indeed be said that the primary value of the report is to be found in the effort 
to interrelate the disjointed factors and elements in the social life of America, in the 
attempt to view the situation as a whole ... as a national union the parts of which too 
often are isolated, not only in scientific studies but in everyday affairs. ... It is the 
express purpose of this review of findings to unite such problems as those of economics, 
government, religion, education, in a comprehensive study of social movements and 
tendencies, to direct attention to the importance of balance among the factors of change. 

That attempt to bring the entire range of social science and what we now 
call behavioral science to bear on a comprehensive array of national issues 
in the United States was unprecedented and, in fact, remains unique. 2 It is 
difficult even to imagine a comparable effort being undertaken today. This 
is not for lack of individuals with the intellectual range and authority of 
Ogburn, whose unifying view the report largely reflects and with whom it 
is most often identified. Rather, the theoretical and philosophical presup- 
positions that could undergird a comprehensive mobilization of scientific 
knowledge in the interest of national planning and reform — presuppositions 
shared in important respects even by the one-time radical activist Ogbum 
and the conservative engineer Hoover — no longer hold sway. The sheer 
size of the research base and the scope of government action have broadened 
immensely, while the disciplines and government bureaus have fissioned 
into a multitude of specialties, whose skepticism about the value of any 
unified effort would be an enormous barrier even were there a will to try 
it. 

This volume therefore does not try to develop and unify more recent 
research findings and make recommendations concerning national trends, 
j Our aim is to spotlight a number of important changes within behavioral 
and social science research itself. Our procedure is not, rtrictly speaking, 
a historical one; the following chapters do not constitute formal histories 
of science, by which one means the careful tracking through time of events, 
ideas, institutions, and persons as these interact to produce continuities and 
changes from one scientific era to another. Rather, our intention is to select 
certain discoveries and advances that have occurred over the last half- 



2 A series of studies carried out by federal mandate in the mid- and late 1960s involved some 
tasks similar to those of the research committee, but no single study had nearly so broad a mandate. 
These efforts included the Advisory Committee on Government Programs in the Behavioral Sci- 
ences (1968); the Behavioral and Social Sciences Survey Committee (1969); and the Special 
Commission on the Social Sciences of the National Science Board (1969) There was also strong 
behavioral and social science representation during this period in the work of special-purpose 
national commissions on such subjects as pornography, law enforcement and criminal justice, and 
marijuana and drug abuse. 
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century and to show in what ways they clearly distinguish the present from 
the past. 

TTie Ogburn report did not address itself primarily to the state of the art 
in fields whose practitioners were involved in its preparation. Yet it provides 
a unique window on certain major contours of thinking in certain fields at 
that time. The authors of the following chapters have drawn portraits of S 
current research on major topics and contrasted these with earlier periods, 
particularly the era of Hoover's presidency. The subjects range from theories 
of large-scale social change to shifts in understanding the visual process; 
within this span fall such topics as economic modeling, ability testing, 
criminology, children's learning, and phonology. All these research fields 
were active a balf-century ago, but in every case the science has changed 
markedly. The changes can be summarized as advances in methodology 
and advances in theory. 

An increasingly extensive, precise array of methods is now used in 
behavioral and social science research. These methods of gathering, or- 
ganizing, and querying data cut much closer than before to the core of 
individual and collective human behavior, enabling researchers and others 
who use the methods to look into ranges of phenomena not hitheno ac- 
cessible to direct observation, analysis, or experiment. Examples of these 
methodological advances are numerous. Current, detailed, accurate em- 
ployment/unemployment numbers simply did not exist at the time of the 
Ogburn report — the work force was counted only by the decennial census, 
and then only in terms of "usual occupations." The best estimates of the 
distribution of income in the United States available to Ogburn's research 
committee in 1930-1931 were based on special data collected by the Na- 
tional Bureau of Economic Research in 1918. Similarly, the Ogburn report's 
chapter on the changing opinions and attitudes of the public is based entirely 
on assessments of articles in leading magazines, books, and newspapers; 
the direct scaling and sample surveys of people's attitudes and opinions 
had not yet been invented. Indeed, methods for generating most of the 
frequently updated indicator series taken for granted by modem researchers, 
public officials, corporate decisionmakers, and evening news watchers did 
not begin to appear until the 1930s. Exact statistical and quasi-experimental 
research on penal deterrence, the preventive relationship between punish- 
ment and crime, did not begin until the 1960s. In the study of mind and 
behavior, the microelectrode, optical devices such as the Ames window, 
the sound spectrograph (invented at Bell Laboratories during World War 
II), and computers, including new mathematical software for efficient so- 
lution of large-scale statistical equations, radically changed the character 
of research undertakings. 

In parallel with but independent of these methodological advances, the- 
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ones in behavioral and social science have become far mr tuned to the 
complexity, subtlety, and persistence of variable, subj; u ■ phenomena 
such as ideas, values, emotions, and images. The classical traditions of 
Western thought that dominated behavioral and social theory earlier in the 
century insisted either that subjective phenomena were immediate reflec- 
tions of material reality, simply summarizing objective experience, or that 
subjective phenomena formed a separate and mysterious realm, inaccessible 
to measurement or rigorous analysis. In contrast, many current theories and 
empirical inquiries guided by them involve an increasingly detailed picture 
of the origins, character, and relations between people's internal represen- 
tations, values, and attachments, and their behavior toward objects, insti- 
tutions, and persons. The theoretical work of Keynes on macroeconomics, 
Chomsky on language generation, Simon on decisionmaking, and Deming 
on statistical quality control emphasizes the importance of human agency 
in effecting performances and outcomes. 

These advances have not occurred without friction. In any field, new 
approaches are connected to earlier disputes and are always controversial. 
Theoretical arguments are seldom concluded by the progress of research; 
instead the debate shifts over time to different and more sophisticated 
grounds. Theories are more often improved than disproved. 

The themes of increasing methodological precision and theoretical so- 
phistication weave through each chapter of the report. The 10 chapters are 
ordered under 3 headings: Understanding Social Change, Numbers and 
Decisionmaking, and Discovering the Mind at Work. While any division 
is to some extent arbitrary, these headings are meant to emphasize some 
of the major lines of advance in the last half-century. 

Social change was, of course, the main focus of the Ogburn report. 
Ogburn's own studies of technological innovation and its consequences 
were highly influential in their day and continue to underlie important 
segments of contemporary popular thought, although much of his perspec- 
tive has since been modified by investigators seeking to understand social 
changes for which Ogburn's theories did not account. 

The role of numbers in decisionmaking, particularly in the ever-changing 
landscape of American markets and political institutions, was a second 
overriding theme of the Ogburn report. This theme is taken up in this volume 
in several contexts: the role played by statistical agencies and information 
in democratic politics, the importance of probabilistic perceptions in me- 
diating the deterrent effects o f punishment on crime, and the distinctive 
calculi of values and probabilities that shape individual decisionmaking. In 
each instance, the authors are as much concerned with the way that long- 
term advances in knowledge interact with decisionmaking processes as they 
are with particular applications of knowledge to decisions. 
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The final section on the mind at work covers a range of discoveries in 
subjects that were not nearly as prominent 50 years ago and received little 
attention in the Ogburn report but have become centrally important in the 
behavioral sciences: individual development, conceptual and linguistic per- 
formance, and perception. The theoretical debates between behaviorist ver- 
sus cognitive or information-processing approaches have been an important 
motor of progress in each of these areas. 

UNDERSTANDING SOCIAL CHANGE 

In the opening chapter, Neil J. Smelser compares assumptions of the 
Ogburn report about the relation between social science and society with 
present-day assumptions. Even as the methods of behavioral and social 
science research have become more sophisticated and precise between 1933 
and 1983, its aspirations to social influence and power have become less 
grand. What resolves this seeming paradox is the shift from a social en- 
gineering view, which posited a direct link between learning facts and taking 
action, to a view that recognizes the necessarily "uncertain connection" 
between knowledge and policy (Lynn, 1978). 

In the social engineering view, objective facts ultimately govern social 
action, whereas ^searchers now see factual knowledge as only one com- 
ponent in a complicated set of determining processes. Rather than taking 
facts as eternal truths residing in the world waiting to be observed, facts 
are now understood as compelling interpretive statements reached by com* 
paring the results of more or less precise measurements undertaken within 
a theoretical scheme. While Ogburn thought the practice of social science 
was essentially a matter of patiently, methodically collecting enough sta- 
tistical data to be certain of the situation, rather than jumping to conclusions 
based on irrational wishes or prejudices, researchers now see the continuing 
need to develop, test, and incrementally improve the precision and inter- 
relation of research methods, measurements, and theoretical systems. 

Ogburn and many of his colleagues held that once the facts were finally, 
clearly known, one would not have to worry independently about the will 
to act on them, since well-observed facts would not admit of conflicting 
interpretations and would convince people to abandon irrational prejudices 
or fantasies. After several decades of increasingly detailed work on the uses 
of scientific knowledge, this view is now known to be oversimple. Many 
factors intervene between the scientific pursuit of knowledge and the social 
pursuit of life, liberty, and happiness: competition for power between dif- 
ferent social groups, conflict over values, and barriers imposed by the 
relative autonomy of different social spheres. Conflicts over policy derive 
from fundamental cultural values and differences in social position as well 
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as more evanescent ignorance or error. Collective action is seen as a problem 
of resource mobilization and leadership, hardly an automatic response to 
scientific evidence. In short, as the social and behavioral science research 
base has become much stronger, it is al^ much more clearly understood 
why policy and politics can never rat on scientific research alone. 

In the next chapter, Albert J. Reiss, Jr. , examines a reciprocal relationship 
that lay directly at the core of Ogburn's interests, the relationship between 
social science innovations and broader social changes . Ogburn was a pioneer 
in formulating the theory that the lead elements in social change are material 
or mechanical inventions such as the steam engine, radio, and elevator 
(without which there would be no skyscrapers), while cultural inventions 
are largely reactive, tending simply to permit social institutions to adjust 
to new material circumstances. Reiss notes that behavioral and social science 
■esearch has led to many technical inventions that have affected and changed 
society. He cites the examples of human testing, sample surveys, quality 
control methods, and cohort analyses. While perhaps not as dramatic as 
the technological impact of the automobile or the transistor, these inventions 
have profoundly affected modern life. 

Ogburn and most of his contemporaries thought that social science was 
an essentially neutral activity that evolved on its own; they did not know 
how thoroughly even such basic scientific matters as the measurement of 
population grew out of social needs and later were adapted to scientific 
ones. Social change can greatly affect the measures and concepts of social 
science, which are in turn increasingly important in shaping the understand- 
ing of and response to change. For example, the massive levels of job- 
lessness experienced during the Great Depression substantially changed the 
way in which the work force was measured. Decennial surveys of workers' 
"usual occupation" were supplanted by monthly surveys of current em- 
ployment status. In turn, these measures were vital to managing the wartime 
economy and subsequently to local, state, national, and corporate planning 
and analysis. 

Reiss concludes that current studies of social change could be improved 
by attending more to organizational and other collective variables in contrast 
to the prevalent bias toward measures of individual behaviors, and by 
reorienting various aspects of the national statistical system. Such reorien- 
tation might not only provide better indications about domestic social trends 
but also aid in comparisons between the United States and other advanced 
industrial societies. 

Carrying this last theme several steps further, Michael T. Hannan takes 
up questions of organizational change, delineating certain recent innovations 
in organizational research. His central concern is with issues of inertia 
versus change and homogeneity versus diversity: how populations of or- 
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ganizations respond to shifting or uncertain environments. In Ogburn's era 
and most of the years since, the dominant lines of organizational analysis have 
been based cm the study of executive decisionmaking and its consequences. 
Theories of rational adaptation proposed that organizational leaders could see 
changes arising in the environment and make more or less sensible plans to 
adjust to than, presupposing that organizations comply with their leaders 9 
intentions. Theories of random transformation proposed instead that organi- 
zational change is loosely coupled with environmental changes, because or- 
ganizations are rife with internal politics, which makes compliance with leaders 9 
intentions an uncertain matter, and because planning in uncertain environments 
is a highly precarious, often hit-or-miss business. Hannan outlines a new 
approach that treats populations of organizations in an evolutionary and eco- 
logical perspective. This type of research examines die scale and frequency 
of changes in socioeconomic conditions, how these changes affect die fortunes 
of generalist versus specialist organizations, which conditions force organi- 
zations to conform to a standard model, and which encourage diversity of 
forms. This approach takes die organizational species as the unit and asks how 
well different species survive specifiable changes in competitive or other en- 
vironmental conditions. 

Hannan points out that Ogburn considered social organizations highly 
inertial, resistant to change in their accustomed routines and motions. The 
Ogburnian prescription to overcome this inertia — application of pressure 
from above in the form of planning based on superior statistical systems — 
strikes present-day students of organization (in the United States, at least) 
as unlikely. Organizational inertia is too strong and experienced managers 
are too clever at finding ways to absorb such pressure without making 
fundamental changes. Hannan concludes that more research needs to be 
done on sources of organizational diversity and creation, since there is 
substantial reason to think that in uncertain environments, new or atypical 
organizations will be more successful in meeting the demands of the sit- 
uation than older, standardized ones. Rather than searching for sources of 
transformation of organizations, analysis of change would be based on 
examining whole populations of organizations to determine their rates of 
birth and death and degree of heterogeneity. In this respect Hannan is at 
one with Reiss's prescription, that more studies should be conducted on 
organizations rather than on individuals. 

Lawrence R. Klein reviews the growth of macroeconomic models and 
forecasts, which apply some of the most highly regarded and dramatic 
advances in social theory and measurement to near-term socioeconomic 
change. Klein traces the beginning of macroeconomic model-building from 
the 1930s. Macroeconomic models as we know them now, involving hundreds 
of aggregate equations and frequently updated series of economic indicators, 
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simply did not exist then. Analyses of the business cycle, apart from isolated 
pioneering attempts at modeling, were based on very general principles and 
on trends of isolated economic variables, rather than on attempts to relate 
these series to each other. Neither today's detailed statistics nor a usable 
theory was available to try to predict such things as the level of employment 
or interest rates. The relationship between these items and such extant series 
as commodity price indexes, and aggregate product measures such as gross 
national product, were not even guessed at. 

The chapter on economic organization in the Ogburn report, by Edwin 
F. Gay and Leo Wolman, attempted to locate the causes of the Great 
Depression in a combination of cyclical and noncyclical factors: the ex- 
traordinary government debt that arose during World War I, which the 
federal government devoted much of the 1920s to retiring (actually reducing 
that indebtedness by about 40 percent); the shift in consumer purchasing 
patterns from pe.ishables to durables, whose replacement could easily be 
postponed, making consumer markets far more volatile; excessive business 
investment in mergers, the creation of holding companies, and other fi- 
nancial combinations; poor banking practices, particularly the willingness 
to devote ever-increasing credit resources to loans on real estate and in- 
dustrial securities (these, in turn, being subject to episodes of speculative 
frenzy) and to extension of consumer credit; an overall depression of ag- 
ricultural prices; and an "unsound international commercial policy" based 
ultimately on the need of defeated Germany to finance enormous war re- 
parations. What is missing from this perspective, for moderns used to 
hearing economic analysts tie up the stock market, foreign affairs, interest 
rates, and shifts in employment in a single paragraph, is any sense of how 
these items interact. 

Keynes's general theory suggested in 1936 a relatively compact way to 
express in a small number of equations the relations between large aggre- 
gates such as the overall supply of money, the gross national product, total 
investment, the average interest rate, and overall employment. National 
and international economic indicator series, which became available in 
increasing numbers shortly before, during, and after World War II, provided 
increasingly informative statistics on which to fit these models. The strategy 
of macroeconomic model-building was perfected in principle after World 
War II, but it became clear that more accurate forecasts required more 
detailed systems of equations. These could be constructed in a preliminary 
way with the statistics then available, but there were severe computational 
limits, which were resolved only after high-speed computer capabilities 
(hardware) and appropriate new mathematical algorithms (software) com- 
bined after the mid-1960s to enable the rapid solution of hundred-equation 
and even several-thousand-equation models. 
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Economists have used mathematical models to discard crude versions of 
a number of macroeconomic theories and to develop more sophisticated 
ones. But the models do not yet permit unambiguous choices between the 
more sophisticated versions of several competing theories about the basic 
workings of the macroeconomy . The typical macroeconomic model fits the 
observed data on which its specific numerical coefficients are estimated, 
but when the fitted model is then applied to generate predictions in other 
cases, it works much less precisely, being satisfactory in some instances 
but not others. 

An obvious aim for users of macroeconomic models is to employ the 
models to control economies the way engineering controls keep physical 
systems on an even keel. This has proven very difficult. Looking to the 
future, Klein notes that, while pure statistical andysis of economic time 
scries currently competes with macro models, it would be useful to find a 
way to combine them and to incorporate many mere social, political, and 
demographic variables in economic analysis. This is the kind of unifying 
recommendation that Ogburn might have applauded. But today the emphasis 
is on the testing and refinement of theories as the primary use for such 
elaborate constructions of social data; applications such as planning would 
be thought appropriate only well down the road. 

NUMBERS AND DECISIONMAKING 

Kenneth Prewitt considers the growth and complex impact on American 
democratic politics of many of the public statistical systems discussed in 
the previous chapters. Noting the close linkage of these statistical systems 
to the research interests and products of behavioral and social science, 
Prewitt focuses on the role of statistical enterprises in such intensely practical 
problems as electoral accountability, political agenda-setting, and public 
resource allocation. Numbers or, more exactly, statistical systems that count 
various aspects of social action and provide numerical indicators of what 
is occurring in society play an essential role in at least three underpinnings 
of successfully democratic states: as vehicles for assessing the performance 
of government policies and programs; as ways of setting agendas by iden- 
tifying or documenting particular interests; and as instruments for allocating 
government resources, for example, by statistical definitions of rights or 
entitlements, as in the allocation of federal funds according to ' 'percentages 
of people living below the poverty line" in a congressional district. Prewitt 
indicates that social scientists who develop statistical methods and data- 
gathering surveys essentially for research purposes are also by virtue of this 
professional expertise the "keepers of the number system," responsible for 
seeing that the best kind of counting is done. He adds that this role entails 
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a responsibility to educate the public, including officials, about what the 
numbers mean — their strengths as well as their limits. 

If we compare the concerns documented by Prewitt with the Ogburn 
report, and particularly the concluding chapter on government and society 
by Charles E. Merriam, we are struck at once by the new significance of 
number systems in mediating political accountability, representativeness, 
and framing of the political agenda. Merriam clearly notes these problems 
and suggests that scientific investigations of human behavior may lave 
broad political significance in the future; he also stresses the enormity of 
the problems facing government then due to the economic transformations 
and crises of the period. Merriam did not, however, share Ogburn's en- 
thusiasm for statistics as a possible solution to social conflict, a basis of 
coordination and planning that might harmonize diverse interests. Prewitt's 
chapter in important respects combines the legacies of Ogburn's and Mer- 
riam' s conflicting views. Prewitt confirms Ogburn's sense of the potential 
power of number systems but couples it with Merriam's sense that the larger 
question is how these and other instruments of governance would be put 
to use in regulating new relations being formed among the government, the 
electorate, and large economic organizations. 

Focusing on a quite specific issue of social policy, H. Laurence Ross 
and Gary D. LaFree review recent studies on the power and limits of induced 
change in formal criminal justice operations to deter street crime and drunk 
driving. They empfc .size how the public perception versus the organiza- 
tional actuality of criminal sanctions can effect the results of changes in 
the law. Before 1960, virtually no empirical, quantitative evidence existed 
on the effectiveness of increasing levels of deterrent threat as a method for 
reducing rates of street crimes or drunk driving. Criminology in the earlier 
period did not analyze the effects of punishment in its various real stages 
of implementation (e.g., rates of police patrolling, apprehension, convic- 
tion, sentencing, etc.) on the prevalence of crime. The chapter on crime 
and punishment in the Ogburn report, by Edwin H. Sutherland and C. E. 
Gehlke, presented statistics on the increased severity of the penalties per- 
mitted by law and the increased sizes of police forces. But their principal 
emphasis was to document that no "crime wave" was evident in the period 
1900-1930, that rates of offending were fairly level except for the new 
crimes of automobile traffic offenses and liquor distribution. Questions of 
rehabilitation were the main ones identified for future research. 

Ross and LaFree believe that the practicable research agenda on reha- 
bilitation has largely been exhausted, with fairly negative results. They 
document a series of recent studies on deterring crime that led to the 
following results. Increasing the perceived certainty of apprehension for 
criminal behavior — by funding more police foot patrols or well-publicized 
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anti-drunk-driving patrol measures — does cut the rate of offending, al- 
though at least in the case of drunk driving the desired effect seems to be 
short-lived. There is serious question whether statutory provisions providing 
for increased severity of sentences for offenders can alone have any effect. 
Drawing their policy analysis to a close, Ross and LaFree conclude that 
manipulation of sanctions appears to be of little independent value, while 
increased police activity is expensive to achieve. They recommend explo- 
ration of alternatives that reduce the damage to victims of street crimes or 
drunk driving, e.g., measures such as victim compensation or more crash- 
worthy vehicles and roads. Other alternatives, not discussed by Ross and 
LaFree, include neighborhood volunteer patrols and efforts to change public 
attitudes and policies on server behavior that can inhibit drink driving. 

Ross and LaFree emphasize that individual perceptions of risk in practical 
situations czn determine in part how policy intentions are translated into 
attitudes and behaviors. Daniel Kahneman and Amos Tversky investigate 
the ways in which individual decisions are influenced by persistent attitudes 
on risk-taking and the value of gains versus losses, as well as by variable 
ways to construct mental accounts of personal behavior, such as expenditure 
decisions. Kahneman and Tversky see a smooth relationship between the 
rationalist principles of decisionmaking formulated in the eighteenth century 
by Bernoulli and the prescriptive theories of rational choice propounded by 
von Neumann and Morgenstern in 1947. The notion that one could make 
decisions by rational, logical, robust quantitative analysis is an appropriate 
behavioral complement to Ogburn's emphasis on statistical systems and 
planning. Even Robert Lynd's iconoclastic chapter in the Ogburn report on 
consumer behavior seeks solutions to the ambiguities of market choice in 
the development of informational consumer advisory groups. But the co- 
nundrums that have come to dominate behavioral analysis of decisionmak- 
ing in recent years — the * 'prisoner's dilemma," Arrow's "impossibility 
theorem," behavioral experiments contradicting von Neumann and Mor- 
genstern's principles — depart dramatically from prescriptive rationalist psy- 
chology. 

Kahneman, Tversky, and others are developing an empirical understand- 
ing of individual choice behavior that involves measurable quantities such 
as dollars or numbers of deaths. These choices are conceived to have two 
levels. At one level, that of analyzing risky choices, individuals faced with 
a decision, seen for simplicity as a series of binary options, must make two 
kinds of subjective computations or estimates regarding the possible out- 
comes of the decision. One set of estimates concerns the probability that 
a given choice at present will lead to one or another future outcome; the 
other set of estimates concerns how desirable each outcome seems at present. 
The desirabilities of the possible outcomes weighted by their prot NUties 
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of occurrence should govern the decision. But persons studied by Kahne- 
man, Tversky, and other psychologists tend to have two kinds of systematic 
biases. First, they tend to overweight low-probability or high-probability 
outcomes and to underweight moderate probabilities. Second, they tend to 
be loss-averse: more negative about losing a certain amount of money than 
they are positive about gaining the same amount. The net result is that, 
when faced with making choices involving risk, people usually prefer to 
take a sure gain rather than to gamble for a greater gain (versus none). But 
with similar amounts at stake, they would rather pass up a sure loss in 
order to gamble on a greater loss (versus none). 

One also has to consider a second level of decisionmaking called mental 
accounting. There is more than one way to frame a choice in terms of 
relative gains versus losses — this largely has to do with what one chooses 
to think of as the zero point. The way that a choice is presented, the frame 
built around the choice, may influence the decision. In other words, decision 
weights may not be robust. People do not necessarily make the same choice 
when faced with the same objective options framed in different ways, 
especially if the different frames take advantage of the biases that are built 
into people's ways of computing desirability and probability. For example: 
it is more attractive to frame property or medical insurance premiums as 
the cost of avoiding highly improbable but very large losses than to frame 
them as ?. sure loss taken in preference to gambling against a range of 
smaller to larger, mostly improbable losses. Sellers of insurance do better 
appealing to people's aversion to catastrophe than indicating how sums paid 
as premiums balance against the costs and probabilities of ordinary illnesses 
or accidents, l«e psychophysics of chance and value cause people to 
over value what they already have compared with what they would pay 
to obtain the same possessions or chances anew and to engage in anoma'ous 
spending behavior depending on how, in their own minds, they think about 
each expenditure: as a direct trade-off of one purchase for another; as the 
cuTent cost of the item relative to a possibly higher or lower cost at another 
place or time; or as a net reduction in their overall assets. 

DISCOVERING THE MIND AT WORK 

The research covered by Kahneman and Tversky reveals an important 
analytical linchpin in theories on how individual choices are composed into 
social, political, and economic trends: the assumption of rationality as a 
characteristic of the sovereign consumer, autonomous citizen, or competent 
manager or worker. This assumption has turned into an increasingly com- 
plex field of study in itself. The final triplet of essays in this volume looks 
directly into the processes that constitute individual thought and complex 
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symbolic behavior, covering research on such tasks as calculation, visual 
interpretation, communication, and problem solving. 

In contrast to the major significance of studies of perceptual and cognitive 
processes today, these research areas were little attended to in constructing 
the Ogburn report. It is instructive to read the chapter on education in the 
Ogburn report, by Charles H. Judd. Judd noted that schools were largely 
replacing earlier economic employment in industry or on the farm as a 
locus of children's activities outside the family. His report urged more 
scientific study of education — but nearly all the attention to research stressed 
the move to less formal teaching methods in lieu of recitation and rote and 
the use of psychological tests to assess the state of learning of the individual 
student. Other chapters on the family, youth, and childhood paid little 
attention to cognitive matters, concentrating instead on personality and child 
welfare. 

Rochel Gelman and Ann L. Brown discuss the revisions in theory and 
method that have occurred in recent years in research on numerical, spatial, 
linguistic, and conceptual capabilities of children from infancy through 
school age, including the pedagogical processes by which in-school and 
out-of-school learning takes place — or bogs down. They place learning in 
the context of interaction between the growing child and the environment — 
initially the physical and family environment, later the school. Studies of 
infants and preschoolers show that innate cognitive faculties are far more 
sharply developed at early ages than is apparent from the limited physical 
capacities that infants have, and that "child's play" is more sophisticated 
in its use of cognitive skills than was thought. Recent studies indicate that 
infants have rudimentary computational abilities, appreciation of the mul- 
tivalent character of objects, and a strong interest in learning about the 
world, and that preschoolers are "tireless explorers" and theorists who 
generally place high values on learning, planning, thinking, and construc- 
tion of mental and physical competences. 

Gelman and Brown look at a full range of cognitive matters, including 
the relations between quantitative reasoning, linguistic concepts, and visual 
perception. Advances in knowledge have resulted from a combination of 
methodological improvements (some using new technical devices), deter- 
mination to study aspects of infant and child behavior with far more attention 
to detail than previously, and withdrawal from earlier theoietical presump- 
tions that the infant's mind must be a blank slate. Modern theory proceeds 
from the idea that complex mental constructs do not arise fr .1 simply 
associative learning or prewiring in the brain, but rather from a series of 
active search-and-learn processes that evolve along with sets of subject^ 
inferential principles. 

A major problem for schools is to retain the natural curiosity and theory- 
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building capabilities of the child and turn these to the mastery of more 
explicit, formal bodies of knov/ledge through appropriate teaching strate- 
gies. Many of the standard pedagogical practices in modern classrooms — 
asking questions to which the teacher already knows the answer, insisting 
on appropriate "turn-bidding" behavior, teaching facts through nonnar- 
rative rote and without contexts for use beyond quizzes designed to measure 
individual performance — differ substantially from teaching sequences the 
child may have experienced prior to and outside school, such as appren- 
ticeship, free play, story or song learning, and role exchange. Gelman and 
Brown point out that to broaden in-school teaching methods, the character 
of out-of-school learning situations should be recognized and better ex- 
ploited, possibly decreasing the number of children who become failure- 
oriented (liable to develop defensive behavior and * 'dumb" self-concepts 
that weigh heavily against success in school) rather than mastery-oriented 
(able to be constructively self-critical and to learn from rather than be afraid 
of making mistakes in the course of mastering new material). 

Michael Studdert-Kennedy analyzes current understanding of the manner 
in which humans encode and decode words, phrases, and meaningful com- 
munications from the highly complex and variable tones of speech and 
motions of sign languages, and he reviews the evidence that linguistic 
competence, the ability to make these reversible codifications between ideas 
and expressions, is a distinct "module" in the brain. He describes the 
emergence of a new kind of research on language centering around the 
theoretical revolution introduced in the 1950s by Noam Chomsky. The 
Drincipal result of that revolution has been to look for the faculty of language 
deeper within the human mind than had occurred under the behavioristic 
interpretation of language as something impressed on the mind as though 
on a blank slate, or within the descriptive tradition, dominant at the time 
of the Ogbum report, which was devoted to characterizing the major lan- 
guage croups, their evolution, and the seemingly endless variety of dialects. 

The current two-level notion of language sees it as a merged product of 
a phonological lexicon, or cross-registry of syllabic sounds and their root 
meanings, and a syntactic generator, which produces as well as decodes 
grammatical sentences. Both levels involve repeated sampling of a finite 
set of rules and devices to produce an infinity of possible utterances (mean- 
ingful sound sequences). The failures in applied linguistic research after 
World War II to produce machines that could translate texts automatically 
from one language to another, read to the blind, or convert speech into 
written text, were highly instructive in progress toward current conceptions 
of language. Studdert-Kennedy reveals how the sound spectrograph per- 
mitted discovery of the complex aural interlayering of syllables in actual 
speech, and how studies of aphasias led researchers to the idea that language 




INTRODUCTION 



of earlier ideas and researches, yet the breakthrough was at once a scientific 
and a practical achievement that went well beyond its original intentions. 

Because they dee embedded in social and technological change, subject 
to the unpredictable incidence of scientific ingenuity and driven by the 
competition of differing theoretical ideas, the achievements of behavioral 
and social science research are not rigidly predictable as to when they will 
occur, how they will appear, or what they might lead to. The chapters of 
this volume show that much has been learned in 50 years and that benefits 
have flowed from this new knowledge. There is in this knowledge a counsel 
of patience and challenge: the study of behavior and social life may be 
slowed or quickened, but it cannot, as Ogburn believed it could, be guided 
down orderly avenues of social equilibration or reform. One can expect the 
overall sphere of knowledge to expand; the area within it, of subjects well 
understood, to increase. But the expanding perimeter of subjects only par- 
tially understood is ever volatile with new kinds of data, new twists on 
older controversies, new ideas to be reckoned with. And beyond the realm 
of the known and the disputed lie far larger territories, unexplored and 
barely imagined. 

Behavioral and social science remains an endless frontier. 
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The occasion for the symposium on which this volume is based was to 
note trends in knowledge in the behavioral and social sciences since the 
publication in 1933 of Recent Social Trends in the United States. That 
massive book was the report of a special committee of social scientists 
commissioned in 1929 by President Herbert Hoover to conduct a survey 
on the subject. It was a monumental undertaking, the last in a series of 
efforts of the Hoover administration to augment the knowledge base for 
social policy. My assignment is to try to capture the main vision of the 
report and to indicate the ways in which that vision has changed in the half- 
century since its publication. 

President Hoover's own account of the reasons for deciding to launch 
the commission is terse. He spoke of the requests of 1 'a number of interested 
agencies" (Myers, 1934:193), and he said that "the country [in 1929] was 
in need of more action in tfte social field." He added, however, that "our 
first need was a competent survey of the facts in the social field." Then, 
upon its completion he described it as "the first thorough statement of social 
facts ever presented as a guide to public policy," adding, however, that 
4 'the loss of the election prevented me, as President, from offering a program 
of practical action based upon the facts" (Hoover, 1952:312). 

Hoover's account reveals his engineering view of social life: first the 
facts, then application based upon the facts. Later I will show how closely 
this mentality corresponded to that of the Ogburn committee itself. 
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READING THE OGBURN COMMITTEE REPORT TODAY 

As indicated, my main task is to interpret the broad vision of the Ogbum 
committee report 1 and the subsequent vicissitudes of that vision. I should 
like to begin, however, by reporting a few reflections that occurred to me 
while plowing through the 39 chapters and 1 ,568 pages of the report. 

First, some things apparently never change. In a chapter on "Recreation 
and Leisure Time Activities," J. F. Steiner (President's Research Com- 
mittee on Social Trends, 1933:931) assured the reader that 

football can hardly be regarded as a passing fad which will soon give way to something 
else. The huge investments in stadia, which must be paid off in future years, make 
almost inevitable the continual approval ot the game by college administrative author- 
ities. Its capacity to generate gate receipts and its value as an advertising medium art 
assets that cannot be ignored. 

In his chapter on "Education, " Charles H. Judd quoted with approval 
Henry Pritchett' s condemnation of the consequences of competition in sports 
(p. 377): 

Every college or universit*' longs for a winning team. . . . The coach is on the alert to 
bring the most promising athletes ... to his college team. A system of recruiting and 
subsidizing has grown up. . . . The system is demoralizing and corrupt ... the strict 
organization and the tendency to commercialize the sport have taken the joy out of the 
game. 

Second, and in like spirit, there were many other statements that also 
might have been written today, even though we know how much things 
have changed in 50 years. In one of the chapters, entitled "The Activities 
of Women Outside the Home," S. P. Breckinridge concluded that "wom- 
en's role in the American community has undergone redefinition during the 
past thirty years" (p. 709). She mentioned industrial advances, the rise of 
specialized services, and the decreased size of the family as having elim- 
inated many of women's household activities. As a result, she noted that 
"large numbers of women through necessity or choice are seeking a new 
place in the economic system." Moreover, 

the shift is not being made without revolutionary changes in attitudes with regard to 
women's responsibilities under the changed surroundings of their lives. Their new 
position ... is giving women a share in the entire life of the community. 

Third, and with the aid of historical hindsight, the reader cannot fail to 



■The report was identified with the name of Ogbum even at the time of its publication (Dufros, 
1933). 
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notice some obviously slighted topics. The committee acknowledged that 
the Great Depression of the time "is not explained/' though apprehensive 
mention of its ravages appears from time to time. A generous interpretation 
of this is that the Great Depression struck only a few months before the 
committee was formed, and that the committee was as confused as the rest 
of the nation by the tragedy. Also, many ideas (Keynes's theory of un- 
employment) and measuring techniques (national economic accounts), help- 
fid in understanding depressions, were not yet invented. In addition, however, 
the Depression was the largest political issue of the day, and Ogburn was 
insistent on presenting facts neutrally and avoiding politically sensitive 
issues, whether by temperament or out of deference to the President. 2 

The same reason might account for the virtual absence of materials on 
race and ethnic relations — though one chapter dealt with racial conditions — 
which seems surprising in light of the presence on the committee of Howard 
Odum, the day's leading sociologist of the South. It is inconceivable that 
such a report could be written today without major attention devoted to 
racial and ethnic issues. In addition to the possibility that race and other 
controversial areas were soft-pedaled, it should be remembered that race 
relations were then still largely regional rather than national, that the political 
mobilization of blacks was in its infancy, and that neithu politicians nor 
social scientists had begun seriously to challenge the racist foundations of 
American social life — all of which would contribute to the low visibility 
of racial problems. 

THE OGBURN VISION OF SOCIAL PROCESS 

One reviewer of Recent Social Trends remarked that "the Committee 
findings are so unified and eloquent as to give the impression of single 
authorship" (Mallery, 1933:211). That authorship was largely Ogbum's. 
It is remarkable to observe the degree to which he dominated the committee 
report. Its main statement echoes his perspectives and theories published 
earlier and later, and the chapters by others frequently echo those perspec- 
tives and theories. It is generally fair, therefore, to treat the report as 
manifesting the Ogburn vision of the social sciences. 

How best to characterize this vision? It is a view that begins with the 
identification of social anomalies and problems that arise through irregular 



2 On this subject, and on Ogbum's conflicts with fellow committee members Wesley Mitchell and 
Charles Merriam on the question of the independence ot (he committee from presidential involvement, 
see Harold Orlans (1982) and Barry D. Karl (1969, 1974). Among the chapter authors. Robert Lynd 
broke most conspicuously from Ogburn by insisting on stressing normative and political issues. 
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social change and ends with the informed amelioration of the anomalies 
and the consequent improvement of society. 

It is possible to produce a graphic representation of what I have extracted 
as the main ingredients of that vision: 

Social -Social -Documenta- -Social -Application -Social 

change (dis- problems tion by ob- invention by policy amelioration 
continuity jective facts change 

and lags) 

Each ingredient leads to the next, and thus constitutes a more or less 
articulate theory of change. In the remainder of my remarks I intend to 
take up each ingredient (as well as the transitions between the ingredients) 
and present a capsule statement of the committee's view, then indicate how 
that view has altered over the decades, mainly as the result of ongoing 
social science research and theory development. 



SOCIAL CHANGE 

One of Ogburn's most notable contributions as a social scientist is the 
notion of "cultural lag," which enjoyed great influence in the social sci- 
ences for a long period and is still important in the literature on social 
change (Ogburn, 1922). The kernel of this theory finds expression early in 
the report itself (p. xiii): 

Not all parts of our organization are changing at the same speed or at the same time. 
Some are rapidly moving forward while others are lagging. These unequal rates of 
change in economic life, in government, in education, in science, and religion, make 
zones of danger and points of tension. 

More particularly, Ogbum saw changes in technology as well as economic 
and governmental organization leading the way of change in modern times, 
with the family and church having declined in social significance. 

The image of society evoked by this notion is what sociologists call "the 
functionalist view," namely, that the different parts of social organization 
stand in systematic — whether harmonious or disharmonious — relationship 
to one another, and that changes in one call for changes in another. This 
view of society, in various forms , dominated a number of the social sciences 
for several decades and still represents a major theoretical position. Sub- 
sequent research and theory development, however, have demonstrated it 
to be both overdrawn and incomplete. Comparative research on the rela- 
tionships between economy and family, for example, have demonstrated 
that even in the face of very rapid industrialization, some traditional family 
forms, far from being "zone? of danger and points of tension," persist and 



ERLC 



34 



THE OGBURN VISION FIFTY YEARS LATER 



even facilitate economic development through recruitment and other mech- 
anisms. The Japanese family is the classic case in point. The implication 
of this kind of research is that the notion of "fit" among the various parts 
of society is weaker than the functionalist view would imply, and that many 
more diverse combinations of structures are possible. A second line of 
criticism and reformulation runs as follows: It is not so much the "fit" or 
"misfit" between different structures that account for pressures for persis- 
tence and change as it is the power positions of groups or classes with 
vested interests and the outcomes of political struggles among these groups. 
This second line of development is seen as exposing and correcting for the 
political naivet6, if not conservatism, of the functionalist position. 



According to the Ogbum vision, social problems emerge as manifesta- 
tions of objective social situations — i.e., discontinuities and lags. For ex- 
ample, the automobile, a material advance, generated an outward drift of 
the population into suburban areas; the coi sequent problem was that the 
central districts were "left to the weaker economic elements and sometimes 
to criminal groups with resultant unsatisfactory social conditions" (Presi- 
dent's Research Committee on Social Trends, 1933:xlii). In another ex- 
ample, the committee attributed increasing divorce rates to the fact that the 
family had fewer economic and other functions, which weakened personal 
ties among its members. 

In the ensuing decades social scientists have become more sophisticated 
in their understanding of what constitutes a social problem. We now see 
that social problems emerge as a complex process of interaction between 
"objective" social conditions, the criteria people bring to bear in evaluating 
those conditions, and the success or failure of efforts of interest groups to 
push their particular criteria forward. Consider another example from the 
report. In their chapter on "The Population of the Nation," Thompson and 
Whelpton brought up the topic of the quality of the population. They argued 
that the differential birthrate among the social classes had resulted in "some 
deterioration in the biological soundness of the national stock" (a social 
problem). Their position on this matter was simply that "as soon as any 
agreement can be reached about the method by which 'undesirables' can 
be selected fror^ the population, they should be prevented from propagat- 
ing" (President's Research Committee on Social Trends, 1933:56). We 
would now regard this view as hopelessly naive. The quality of the pop- 
ulation is not some kind of objectively given problem. It is a problem for 
some (eugenicists) and not a problem for others (the right-to-life movement) 
because the ideological priorities of the two groups — in the name of which 



SOCIAL PROBLEMS 




26 



NEIL J. SMELSER 



problems are identified— are different if not contradictory. Whether the 
quality of population gets officially identified as a social problem calling 
for action depends on the outcome of a political struggle among these and 
other interested groups in society. 

Social problems, then, can be defined by the presence of "objective 
facts" only if there is consensus about the meaning and significance of 
those facts. The Ogburn committee, in regarding social problems as the 
objectively determinable result of objectively observable lags and discon- 
tinuities, was, in effect, imposing a kind of imagined consensus on society. 
That kind of consensus rarely exists. We now know that social problems 
are not matters of objective fact but matters of an uncertain, disputed set 
of both facts and principles. Recognizing this, we can appreciate why such 
a large proportion of the debates about social problems are debates not 
about the existence of facts but about symbols, about the legitimacy of the 
competing sets of criteria by which a factual situation will or will not qualify 
as a genuine social problem. 

DOCUMENTATION BY OBJECTIVE FACTS 

In his introduction to Recent Social Trends, Herbert Hoover spoke of his 
desire "to have a complete, impartial examination of the facts" in the 
report. In a way this phrase encapsulates the mentality of the social sciences 
in the early twentieth century — the acme of positive science, which regarded 
empirical facts as objective things, waiting to be observed, recorded, and 
quantified. This mentality manifested itself in a variety of different ways. 
To name a few: 

• the pioneering efforts to develop measures in psychology and education, 
including the work of Thurstone on measurement of attitudes and Terman 
on the measurement of intelligence. 

• the reaction of the institutional economists (among them Veblen and 
Commons) against what they regarded as the abstract, disembodied theory 
of classical economics; as part of this polemic they insisted on the empirical 
study of economic life in concrete institutions. 

• in anthropology the reaction of the diffusionists (especially Boas) against 
classical evolutionary theory, and their insistence on detailed, empirical studies 
of the movement of cultural items and artifacts from culture to culture. 

• Ogburn's own dismissal of classical evolutionary theory as speculative 
and wrong, 3 and his insistence that the study of evolution must rest on the 



3 Ogburn wrote that the theory of "the inevitable series of stages in the development of social 
institutions has not only not been proven but has been disproven" (Ogburn, 1922:57). 
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"actual facts of early evolution" (Ogburn, 1922:66). Ogbum (1929) cel- 
ebrated the rise of scientific social science in his presidential address to the 
American Sociological Society in 1929, stressing its emphasis on objective 
measurement, verification and truth, and its separation from methods in 
other areas such as ethics, religion, education, and propaganda. 

Not everybody found comfort in this position. Pitirim Sorokin, sociologist 
at Harvard, in a savage review of Recent Social Trends in 1933, bemoaned 
what he called "holy and immaculate quantification": 

In the future some thoughtful investigator will probably write a very illuminating study 
about these "quantitative obsessions" of a great many social scientists, psychologists, 
and educators of the first third of the twentieth century, tell how such a belief became 
a vogue, how social investigators tried to "measure" everything; how thousands of 
papers and research bulletins were filled with tables, figures and coefficients; and how 
thousands of persons never intended for scientific investigation found in measurement 
and computation a substitute for real thought. . . . 4 

Be that as it may, Ogburn's preference for stressing objective facts, apart 
from opinions and value judgments, held sway in the report itself The 
chapters and monographs, the committee said, "present records, not opin- 
ions; such substantial stuff as may serve as a basis for social action, rather 
than recommendations as to the form which action should take" (President's 
Research Committee on Social Trends, 1933.xciv). The contributors, more- 
over, were "bound strictly by the limitations of scientific methods," and 
if they occasionally strayed beyond these limitations the reader could see 
clearly when they were giving their own opinions (p. xcv). 5 

Even at the time, this "factual-statistical" representation of the world 
was regarded by others besides Sorokin as wanting. Adolph Berle, a member 
of Franklin D. Roosevelt's brain trust, commented that the report "has the 
barrenness of . . . statistical measurement ... the desire for objectivity 
has been carried entirely too far." And Charles Beard, the historian, re- 
marked that "the results [of this report] . . . reflect the coming crisis in 
the empirical method to which American social science has long been in 
bondage" (Orlans, 1982:9). And in the decades since the acme of Ogburnian 
positivism we have come to view the world of empirical facts not so much 



4 Thrcughout his review Sorokin assaulted the Ogbum committee report for its multiplication 
of meaningless quantitative tables and citations. In a rejoinder Ogbum countered with the assertion 
that "only one-tenth of the space is taken up with tables," a statement that constitutes a kind of 
ironic confirmation of Sorokin's plaint. 

5 Ogbum wrote a short methodological "note" on the necessity to separate facts and opinions 
sharply from one another, but this was not published as part of Recent Social Trends, probably 
because not all of the members of the committee subscribed to his position (Bulmer, 1983). 
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as a realm of observable and measurable things but rather more as the 
purposeful creation of human agents and investigators. This realization, 
moreover, has resulted from developments both at the level of theory and 
of empirical research. At the theoretical level, early critics of positivism, 
such as Talcott Parsons (1937), argued that facts could not be viewed apart 
from the conceptual framework by which they are evoked. In his influential 
work on the history of science, Thomas Kuhn (1970) argued that both 
scientific facts and scientific knowledge are relative to the kinds of para- 
digms invented and employed by scientists. And more recently critics like 
Jiirgen Habermas have hammered away at exposing the ideological and 
political foundations of "objective science." The cumulative effect of these 
kinds of intellectual development has been to effectively er^' z the positivist 
dream of the early twentieth century. 

At the level of social research our assessment of "facts" has also become 
more sophisticated. The dominant approach, of course, is still that the 
behavioral and social sciences are empirical sciences above all, and we 
have improved our measurement techniques and data bases enormously. 
But social scientists no longer conceive, as a Durkheim or an Ogburn might 
have done, of the crime rate as a "social fact" to be observed. We know, 
on the basis of empirical research, that a "crime rate" is a vastly different 
phenomenon, depending on whether the investigator consults police records, 
observes police in action, asks people whether they have ever been victims 
of crimes, or whether they have ever committed crimes. We know also that 
every one of these measures is defective in different ways. 

We know that there is no such "thing" as public opinion, which can be 
measured scientifically by randomly sampling a portion of the population 
and interviewing them on a given set of issues. Research has shown that 
results of such surveys vary significantly depending on how the questions 
are asked, what kinds of people do the asking (whites or blacks, men or 
women, investigators dressed in suits or investigators dressed in dirty jeans), 
and how people distort their responses on sensitive issues (such as how 
much they smoke, drink, or use drugs) (Cannell and Kahn, 1968). We have 
also come to acknowledge that certain ideological assumptions or biases 
are built into some of the measures we use. For example, the fact that, in 
the sample survey, we give equal weight to all respondents in analyzing 
data reflects a kind of "democratic" assumption that each person's voice 
counts as much as another's — an unrealistic assumption given what we 
know about actual patterns of participation, influence, and power, even in 
democratic societies; it is the (perhaps unwitting) translation of the electoral 
principle of a democracy into a "one-person, one-response" assumption. 

Interestingly, these kinds of acknowledgments make simultaneously for 
both greater humility and greater sophistication on the part of social in- 
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vestigators. We are cognizant of the many sources of measurement error 
that axe generated in the creation and study of social data and in its as- 
sessment by investigators (Turner et aL, 1984). By the same token, how- 
ever, investigators are now equipped systematically to take measurement 
errors into account when representing and statistically manipulating data, 
by using techniques that would not come to mind within a simple posttivistic 
perspective. 



According to the Ogburn vision (President's Research Committee on 
Social Trends, 1933:Ixxi) the massive accumulation and description of 
social facts can reveal the broad range of social problems generated in a 
society undergoing rapid and irregular social change. These problems, 
moreover, ' 'can be solved only by further scientific discoveries and practical 
inventions." 

The imagery of a scientific invention — as well as its application — per- 
vades the Ogburn vision of social reform and the amelioration of social 
problems. In the chapter on "The Influence of Invention and Discovery," 
Ogburn and S. C. Gilfillan wrote that "there are social inventions as well 
as mechanical ones, effective in social change" (p. 162). They gave as 
examples the city manager plan, group insurance, installment selling, the 
passport, and universal suffrage. 

The committee (1933:lxxiv) envisioned the need for a massive effort in 
the field of social invention: 

If one considers the enormous mass of detailed work required to achieve the recent 
decline in American death rates, or to make aviation possible, or to increase per capita 
production in farming, one realizes that the job of solving the social problems here 
outlined is a job for cumulative thinking by many minds over years to come. Discovery 
and invention are themselves social processes made up of countless individual achieve- 
ments. 

Read today, this link between knowledge about social problems and social 
invention appears somewhat mechanical and politically naive. First, little 
attention is given to the exact mechanism that provides the transition be- 
tween the accumulation of knowledge and social invention. In his presi- 
dential address to the American Sociological Society in 1929, Ogburn 
(1929:5-6) outlined a simple model. Science, he said, is an accumulation 
of thousands of verified "bits and pieces of new knowledge. ' 9 He envisioned 
that this would occur through careful, patient, and methodical work, much 
of which could and would be carried out by "dull and uninteresting per- 
sons." Once in a while, "one of these little pieces of new knowledge 
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becomes nf very great significance, and it is then called a great discovery 
or a great invention." Ogburn predicted that when the social sciences 
became truly cumulative, all social scientists would be statisticians, and 
social theory "will have no place in a scientific sociology, for it is not built 
upon sufficient data." 

This account of what constitutes a scientific discovery does not square 
with our more contemporary understanding. We appreciate that the 4 'very 
great significance" of an empirical finding derives from the fact that it 
demands a substantial change in the way we formulate our general under- 
standing of the world — in short, in the way we formulate theory. Typically 
a "discovery" is the verification of findings that cannot be accommodated 
by an accepted scientific framework. Or, alternatively, a "discovery" in- 
volves a reformulation at a theoretical level, such that heretofore unrelated 
empirical findings can be related to one another and explained within a new 
framework or by a new principle. Put another way, scientific discovery 
always involves a relation between empirical findings and theoretical for- 
mulation, not an accumulation of empirical findings (Kuhn, 1970). 6 

Furthermore, with respect to "social inventions" a different set of 
processes needs to be invoked. Consider the social invention of universal 
suffrage— one of Ogburn's examples. It is an invention in the sense that 
it is a contrivance designed to facilitate the operation of the democratic 
process. But the role of knowledge in the crystallization of such an 
invention is a limited one. Much of the "knowledge" involved has not 
been scientific in the sense of having been proven or verified; it has been 
more in the nature of lore associated with democratic philosophies, which 
takes the form of assumptions about the workings of political influence 
and power. Furthermore, the dynamics of the invention were not the 
dynamics of assembling knowledge so much as the historical struggles 
of different kinds of classes and groups for access to the political systems 
of democracies. 

More generally, social inventions appear to be the invocation of estab- 
lished or imputed knowledge in relation to some desirable social goal or 
social value. Consider the historical "invention" of desegregated education 
by the United States Supreme Court in Brown v. Board of Education in 
1954. In that decision, justices cited a wide variety of social-science findings 
to the effect that separate facilities engendei feelings of inferiority in blacks. 



6 For an earlier statement of the relations between empirical findings and theory in the social 
sciences, see Robert K. Merton's two essays, "The Bearing of Sociological Theory on Empirical 
Research" and "The Bearing of Empirical Research on Sociological Theory 0 (Merton, 1968:139- 
171). 
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But as Judge David Bazelon (Eiscnberg, 1969:374) argued, reliance on 
these findings might have misstated the true basis for the case: 

In 1896 the court had approved the 4 * separate but equal" doctrine. While the country 
might then have lacked the sophisticated studies available in 1954, any honest person 
would have conceded at the time of Plessy v. Ferguson that segregation undoubtedly 
would have made Negroes feel inferior. The assumption of inferiority was the rationale 
for the practice; no black man could help but perceive that separate train cars and 
separate schools kept him in his place. 

Since we already knew what Kenneth Clark Hid others told us, the public could justly 
ask of the Supreme Court in 1954, why the law had changed. The answer, of course, 
was that our values had changed. Plessy v. Ferguson was discarded not because social 
scientists told us that segregation contributed to feelings of inferiority, K 'it because by 
1954 enough people in this count* v believed what they did not in 1&96 — that to thus 
insult and emasculate black people was wrong, and intolerable, and therefore, a denial 
of the equal protection of the law to blacks. 

In the area of social inventions, as in other areas, the committee's in- 
sistence on the neutrality of scientific knowledge and on its separation from 
matters of opinion involved a cost. In this case the cost was to miss a great 
part of the intricate interplay between knowledge — whether imputed or 
established — and the political and cultural dynamics of society. 

APPLICATION BY POLICY CHANGE 

Toward the end of its main report, the committee (p. Ixxiii) noted with 
approval the "increasing penetration of social technology into public wel- 
fare work, public health, education, social work and the courts." In ad- 
dition, it called for the formation of groups through the Social Science 
Research Council to bring technical advice to decisionmakers, and perhaps 
the formation of a national advisory council to focus on 44 the basic social 
problems of the nation." 

We have seen, in the discussion immediately preceding, that to invoke 
the imagery of technology in the formation of social policies is both limiting 
and misleading. The same can be said when that imagery is carried over 
to the implementation of social policies. Two observations are in order on 
this score. 

The first has to do with the adequacy of knowledge in the name of which 
policies are implemented. The putative knowledge cited in the Brown v. 
Board of Education case was that integrated school facilities would lead to 
a decrease in feelings of inferiority on the part of blacks. Scores of studies 
on the self-esteem of black children in diverse settings tell us that so many 
contingencies affect self-esteem— class, neighborhood, the behavior of in- 
dividual teachers, the fortunes of the movement to improve conditions for 
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blacks in the larger society, to name a few — that it is impossible to posit 
a single, direct link between type of schooling and the self-esteem of its 
pupils (Smelser and Smelser, 1981). Speaking more generally, most sci- 
entific knowledge of all sorts is organized in the form of contingent 
predictions, that is, connections between variables (such as government 
deficit-spending and r«ie of inflation, or type of educational arrangements 
and self-esteem), with other things held constant. This is the way knowledge 
is generated— by holding various factors constant, whether by experimental 
or statistical manipulation, in order to establish precise causal linkages. But 
in the ongoing flow of social life, other things are not constant, and precise 
prediction of consequences is impossible because of the interaction among 
multiple forces. 

A second complexity arises through the fact that any kind of policy, 
when implemented, is likely to generate a variety of unanticipated side 
effects, not all of which are predictable or likely to be beneficial. Consider 
only one example, that of attempting to ameliorate the incidence of suicide 
in society. One feasible policy would be to attack intensively the social 
conditions of certain high-risk groups, such as the elderly, with the aim of 
reducing feelings of isolation, desertion, and despair. In implementing this 
kind of policy, a community might embark on a program of establishing 
senior citizen clubs as social centers, and making individual agencies, such 
as suicide prevention centers , more available to them. Integrating the elderly 
into more meaningful social communities might decrease the incidence of 
suicide. But in addition, it might facilitate the formation of more definite 
political groups among the elderly, which are traditionally antipathetic to 
educational programs that call for the passing of school bonds, as well as 
to community health programs such as the fluoridation of drinking water — 
to programs, that is. that represent the implementation of other social goals, 
usually considered also worthy by the planners sponsoring the suicide- 
prevention efforts. Knowledge of the diversity of consequences of different 
programs may in fact result in more intelligent setting of priorities in plan- 
ning. In any event, it provides a different and better model for planning 
than that of the direct application of bits of knowledge toward the solution 
of specific problems. 

SOCIAL AMELIORATION 

The last link in the chain of social process is the ultimate impact of 
knowledge on society's welfare. As indicated earlier, the committee (pp. xlii- 
xliii) was apprehensive about the trend toward higher divorce rates in 
American society; "our culture may be conducive to further increases in 
divorce unless programs are instituted to counteract this tendency." The 
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problem arising for society is "how ... to make marriage and the family 
meet more adequately the personality needs and aspirations of men and 
women and children." And in pointing the way to dealing with such a 
problem, the committee once again turned to the importance of knowledge: 
"the study of marriage and divorce may not only aid in stabilizing the 
family but may also help us on the road to happiness." 

My comments up to this point should indicate how many unstated, un- 
acknowledged, and contingent steps there are between the objective study 
of a social state of affairs and its improvement. But it should also be pointed 
out that "happiness" or improvement as a consequence of purposive plan- 
ning and programs is itself a contingent matter. Just as the O^burnian vision 
of what constitutes a social problem rests on the committee's imagined 
consensus on values, so does its notion of amelioration. In areas where 
widespread consensus on values obtains in society — for example, the health 
of the population — programs like mass immunization are likely to be un- 
controversial and widely regarded as ameliorative. When, however, such 
consensus is lacking, one group's amelioration is another group's deteri- 
oration. Even the Ogburn committee's invocation of the value of "family 
stability" as a consensual matter could be and has been challenged by those 
committed to communal and other arrangements believed to be superior to 
the traditional family. When consensus is lacking, moreover, debate comes 
to focus not only on the consequences of programs but on the relative 
legitimacy of the competing cultural values by which we judge those con- 
sequences. In this respect, the assessment of consequences is as deeply 
embedded in the political and cultural dynamics of a society as is the 
identification of social problems. 



We end with a kind of paradox. Even though the Ogburn report seeks 
legitimacy mainly from the framework of positive science, its vision of the 
social process is characterized by a number of items of faith: faith in the 
capacity of objective knowledge to identify social problems, faith in the 
capacity of cumulative knowledge to result in social inventions, and faith 
in the capacity of those inventions to solve the social problems. That par- 
ticular set of faiths permitted the committee to be simultaneously naive and 
pretentious — at least as judged by our contemporary understanding — about 
the role of the behavioral and social sciences in social policy. The same 
set of faiths permitted the committee to define social and behavioral sci- 
entists as simultaneously disembodied from the political process and es- 
sential ingredients to that process. Such are the paradoxical consequences 
of the positivist-utilitarian view of the relations between science and society. 
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Today I believe we would acknowledge the tremendous importance and 
utility of the social sciences in the social and political life of the nation. In 
its first report (Adams et al., 1982), the Committee on Basic Research in 
the Behavioral and Social Sciences acknowledged this and pointed to three 
areas in particular technical contributions in the information-generating 
process, such as sample surveys and standardized testing; changes in the 
way we do things, such as administer therapy, predict economic trends, 
and run organizations; and changes in the way we think about things such 
as poverty, race, social justice, and equity in society. Yet the present 
committee, mindful of the kinds of complexities and contingencies that 
have been touched upon in this discussion, regarded these not as utilitarian 
applications of bits of scientific knowledge, but rather as arising from and 
intertwined with the social purposes and cultural aspirations of the nation 
as a whole. As a result of change in our thinking about the relations between 
science and society, I believe we have become, paradoxically, both more 
sophisticated in our research design and measures and less pretentious in 
our aspirations than we were 50 years a^o. 
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INTRODUCTION 

Surely among the most influential models of social change was that 
developed by William Fielding Ogburn ( 1922b). Ogburn described a process 
of invention followed by cultural change, followed by social disorganiza- 
tion, and finally social adjustment (Ogburn & Nimkoff, 1940:877). Ogburn 
concluded that public policies and interventions meant to guide modern 
social change would depend heavily upon the development of a unified 
national statistical system to collect and process information about social 
trends (Ogburn, 1929:958). Although Ogjwrn's vision of a unified statistical 
system has not been realized, he may well have regarded this as but a lag 
in adjustment to which all indentions give rise. 

This essay does not attempt to assess systematically Ogburn's (1922b) 
theory of social change, his contributions to our understanding of social 
trends (1928-1935, 1942), or the development of statistical systems (Og- 
burn, 1919; President's Research Committee on Social Trends, 1933). But 
it draws heavily upon that vital heritage. Three major questions are addressed: 
(1) How do inventions, especially those of the behavioral and social sci- 
ences, affect social changes and adaptations? (2) How do social changes 
affect measurement? And, (3) How do contemporary behavioral and social 
science models, concepts, anc methods affect our understanding of society 
and how it changes? 

SOCIAL INVENTIONS 

In Ogburn's view, in sntions, particularly mechanical ones, are the source 
of all cultural growth and evolution. Inventions also cause disruptions in 
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related parts of culture and in social organization, necessitating adaptations 
and adjustments. But these adjustments take time, and Ogburn therefore 
called them cultural lags, noting that "Over the long course of social 
evolution, measured in thousands of years, cultural lags are invisible. At 
any particular moment, however, they may be numerous and acute* 9 (Og- 
burn in Duncan, 1964:30). 

Although Ogburn emphasized that social inventions can cause social 
change (1934: 162), his theory and his own work gave priority to mechanical 
inventions (1922b:76-77; Ogburn & Nimkoff, 1940:809-8 10). 1 This be- 
nign neglect of social inventions is coupled with Ogburn's firm conviction 
that the behavioral and social sciences can shorten cultural lags. Nowhere 
did he summarize this belief better than in his chapter on invention in Recent 
Social Trends (President's Research Committee on Social Trends, 1933:166): 

Society will hardly decide to discourage science and invention, for these have added 
knowledge and have brought material welfare. And as to the difficulties and problems 
they create, the solution would seem to lie not so much in discouraging natural science 
as in encouraging social science. The problem of the better adaptation of society to its 
large and changing material culture and the problem of lessening the delay in this 
adjustment are cardinal problems for social science. 

Ogburn concluded an essay on trends in social science with these obser- 
vations (1934:262): 

The greatest obstacles to the development of science in the social field are complexity 
of the factors and the distorting influence of bias. These are formidable, but certainly 
the trends of the present century are most encouraging, and we may look forward, 
because of social science, to a greater control by man of his social environment. 

The relatively lesser emphasis that Ogburn placed on the role of social 
as compared with material technology persists to this day. Even social and 
behavioral scientists tend to oveilook their role in processes of social change. 
In fact, it is quite plausible that social inventions, especially those of the 
behavioral and social sciences, are a major cause of change, as well as key 
elements in society's adaptation to change. The selective perception that 
limits recognition of the role of behavioral and social science inventions 
may indeed count as a cultural lag. 



Ogburn's interest in social inventions, their effects, and lags jn adapting to them preceded the 
writing and publication of his classic study, Social Change (1922b). His doctoral dissertation 
(1912) was on child-labor legislation. While teaching at Reed College in Oregon, he became 
interested in the initiative and referendum as methods of direct legislation (1914, 1915). Still later, 
he was interested in the consequences of women's suffrage (Ogbum & Goltra, 1919). As Duncan 
concludes, however, this early interest in social invention? arose, in part, from political sympathies 
with social problems and reforms. 
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Underlying the major themes for this first section is a speculation that 
the relative contributions of the respective sciences and technologies to 
social change are altering substantially. Modern societies have come to 
depend heavily on the behavioral and social sciences and their technologies 
and cannot run without them. As material technology replaces labor, non- 
material technology may come to dominate social change, if it has not 
already done so. 

Major Social Inventions and Their Consequences 

Ogbum was fascinated by the effect of what he distinguished as major 
technological inventions such as the ship, the airplane, the internal com- 
bustion engine, and the elevator. He also devised lists of significant social 
inventions (1934:162), such as the minimum wage law, the juvenile court, 
Esperanto, installment selling, and group insurance. Yet he apparently never 
attempted to differentiate between social and behavioral inventions with 
potentially major versus those with more limited or minor effects. Some 
social and behavioral science inventions, nonetheless, have had such sig- 
nificant and widespread impact that one cannot imagine modern democratic 
societies operating without them. Two such inventions, noted in the first 
report of the Committee on Basic Research in the Behavioral and Social 
Sciences (Adams et al., 1982) are singled out here: human testing and 
sample surveys. 

Human Testing Ogburn (1950) generally attributed invention to three 
fundamental causes: mental ability, social demand, and the accumulation 
of cultural elements from which inventions are fashioned. To pinpoint the 
origins of a particular invention is not a simple task, given the multiplicity 
of able minds, the variation in the sources of demand, and the different 
patterns that elements of the cultural base may take. 

The invention of human testing is usually attributed to a nineteenth- 
century scientific interest in the study of individual differences. The history 
of tests of distinctly mental abilities is better documented than other major 
forms of human testing (Wigdor & Garner, 1982). Tests of mental abilities 
derived from psychologists' attempts to understand differences in intelli- 
gence among individuals. Galton (1869) first devised a series of sensory 
discrimination tests to shed light on individual differences, followed by 
Cattell (1890) and others who developed batteries to test sensory and motor 
abilities. But it was a demand within the French Ministry of Education, to 
distinguish subnormal from normal children in Paris schools, that led Binet, 
in collaboration with Simon (1905), to introduce the concept of mental age 
and scales to measure it. 
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Ogburn often noted that inventions diffuse more readily where there is 
a demand for them; the Binet-Simon scale diffused quickly. The test was 
translated into English by Goddard in the United States in 1908, into Italian 
by Ferrari in 1908, and into German by Bobetrag in 1912 (Klineberg, 
1933:323). Translation was followed by revision, such as the Stanford- 
Binet test published by Terman and his collaborators in 1916 (Klineberg, 
1933:324). 

Although testing has been important to the conduct of research and was 
a product of psychological laboratories, its development and invention have 
been highly responsive to social demands arising outside the laboratory, 
initially by the public schools to sort children and somewhat later by the 
U.S. Army to screen World War I draftees. Testing is now at least as 
consequential for the major operating organizations in industrial societies 
as for the conduct of research. The testing industry is integral to four major 
organizational tasks: (1) selection of persons as employees or clients; 

(2) classification of employees or clients according to organizational tasks; 

(3) assessment of human performance within organizations; and (4) assessment 
of the "human output** of organizations. 

Ogburn distinguished primary from derivative effects of invention. Since 
societies and their organizations do not systematically collect and process 
information about such effects, even less so for social than mechanical 
inventions, it is far easier to identify qualitatively than to document the 
quantitative impact of the invention of human testing. The primary effects 
are clearly on employment and the management of organizations. Testing 
occupations generate substantial employment in the U.S. Civil Service, the 
Armed Forces, public and private school systems, and in large private 
industrial firms, most of which employ testing extensively in at least one 
of the four organizational tasks mentioned above, as well as in the devel- 
opment, production, and marketing of rats themselves. 

Public controversy and litigation may surround the use of testing in 
organizational management. Because many organizations base selection and 
promotion on testing, test information can be influential in legal proceed- 
ings. The testing industry has been challenged to produce different kinds 
of tests as a consequence of such litigation. The courts have played a 
substantial role, for example, in structuring tests for selecting and promoting 
women and minorities in police and fire departments. 

Derivative effects of behavioral and social inventions include the spur 
they often provide to mechanical inventions. The first high-speed printer 
(essential for modern computers) was developed for a scoring machine by 
the educational tester Lindquist. In the highly competitive educational 
achievement testing industry, the rapid scoring and delivery of test results 
to schools was critical to market shares. As this example illustrates, social 



ERLC 



43 



40 



ALBERT J. REISS, JR. 



invention and mechanical invention are seldom independent of one another. 
The design of modern control systems necessarily involves both human 
performance measures and technological components. The displacement of 
humans by computerized robots is also a replacement of some human skills 
by other human skills . The machine's displacement of manual or mechanical 
labor moves the labor force toward the cognitive skills that are most dis- 
tinctively human. 

It seems no exaggeration to estimate that the average person in an in- 
dustrial society encounters the products of the testing industry virtually 
every year for the first two decades of life and in many cases for much of 
his or her career. Even where not subject to standardized tests, occupational 
life is controlled by elementary concepts of ability and achievement de- 
veloped in testing. Increasingly, testing concepts enter the debate over major 
issues in society, such as the recent controversy over merit pay for teachers — 
especially whether merit can be based on testing teacher performance. 

Aside from the considerable effect on every other sector of society, the 
invention of testing precipitated many new inventions in statistics and other 
behavioral and social sciences. These inventions have significantly affected 
the conduct of research, and the results of that research have in turn affected 
society. The early testing of intelligence and mental abilities led to Spear- 
man's attention to the reliability of measures and his positing of the G 
factor in intelligent (Spearman, 1904); this development gave rise to factor 
analysis, especially with Holzinger's (1930, 1931) development of the bi- 
factor method (through a study with K. Pearson and collaboration with 
Spearman, 1925). A variety of statistical factoring methods were soon 
invented as the concept of intelligence changed with empirical testing, 
including multiple-factor methods (Thurstone, 1931, 1935) and principle 
component methods (Kelley, 1928, 1935; and Hotelling, 1933). As factor 
analysis was extended to other human traits and characteristics, e.g., human 
emotions (Burt, 1915, 1939), attitudes, and opinions, awareness of its 
limitations led to statistical inventions for discerning latent structures (Gun- 
man, 1950; Lazarsfeld, 1950, 1954, 1967;Rasch, 1968, 1980) and statis- 
tical interactions (Goodman, 1970). 2 These analytical innovations have 
shaped theory and hypothesis testing in behavioral and social sciences and, 



2 The history of social science inventions should become an important part of any sociology of 
knowledge as well as being integral to the study of social change The ways that demand shapes 
intellectual agendas is not well understood. Consider the fact that Lazarsfeld undertook his work 
on latent structure analysis and Gunman on scale analysis in connection with research for the 
Research Branch of the Information and Education Division of the U.S. War Department in World 
War II. 
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as Holzinger noted in 1 941, have had major applications in physics, med- 
icine, and business forecasting (1941:5). 

Sample Surveys Modern sample surveys rest on early inventions. The 
principles of random selection, objective probability, and stratified random 
sampling are well over a thousand years old (Dunca*, I984:iv). Survey 
modes of data collection also have been around for a considerable time. 
But the coalescence and systematization of these inventions into the modern 
stratified probability survey of a population are a product of modern be- 
havioral and social science, coming mostly within the last 50 years. 

As in the case of testing, there is a dearth of data to assess the effects 
of this invention, particularly its role in social change. Yet, we can plausibly 
argue that, except for institutional data collected as a by-product of orga- 
nizational routines, the sample survey has become the major mode for 
linking action to intelligence in modern democratic societies. Even news 
organizations do not any longer claim to speak for the aggregate except in 
a metaphorical sense; but the opinion poll is accepted as doing so. 

It is difficult to trace all of the ways that the sample survey has come to 
dominate organizational and individual decisions and operations. A few 
examples are offered simply to illustrate how pervasive it has become and 
how instrumental it is in changing behavior. 

Perhaps nowhere has the invention of sample surveys altered the pattern 
of activity as mu^i as in American electoral politics. Despite an abundance 
of skepticism about candidate and opinion polls, no candidate runs for 
major political office without a private polling operation. Media coverage 
of elections compares candidates in terms of their poll status; legislative 
and executive action is responsive to poll information; and political issue 
and candidate polls are a substantial American industry. 

A second major area where surveys dominate is in providing intelligence 
for government decisionmaking. Much of the information for operating the 
government comes from sample surveys. The IRS, for example, has used 
sample surveys in its Audit Control Programs since 1948, and as an estab- 
lished part of its Taxpayer Compliance Measurement Program (TCMP) 
since 1962 (Long, 1980:55). These surveys of tax returns and filing com- 
pliance in the general population have become a principal means for the 
IRS to set its enforcement strategy. Major short-term policy indicators on 
unemployment and the cost of living are based wholly or in part upon 
sample surveys. The Survey Division of the Bureau of the Census has 
become one of its largest, quite apart from many other divisions within the 
bureau also operating sample surveys or collecting information through 
them. The Current Population Survey annually reaches about 1 in 1,000 
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households. No organization of any size remains unsurveyed by some gov- 
ernment organization (though not always by sample surveys). 

A third major area for sample surveys is marketing. Market research may 
be the dominant sector in sample surveying, surpassing the resources al- 
located to surveys by governments — though data for precise comparisons 
are lacking. 

There are several kinds of market research. Sample surveys affect product 
development and sales strategies. They locate territories or populations for 
marketing a particular good or service. Surveys estimate the demand for 
new products or satisfaction with existing ones. The mass media, which 
rely on sample surveys for news, rely even more heavily on them for market 
information. No industry is more sensitive to the sample survey than tele- 
vision, in which ratings of network programs determine advertising revenues 
and the fate of writers, producers, and stars. 

As a fourth major consequence, the sample survey has become the major 
means of developing social indicators in postindustrial society. Sample 
survey information is aggregated into indicators in two different, albeit 
related, ways. Surveys are used cross-sectionally — at a point in time — to 
evaluate relative performances or outputs, as in the Nielsen ratings of 
television programs, or to compare electoral candidate strengths. Social 
indicators are also used to forecast, monitor, control, or respond to the 
course of change over time. For example, the monthly Current Population 
Survey estimates unemployment, residential tenure, and vacancy rates; the 
semiannual National Crime Survey examines victimization rates; the Annual 
Housing Survey reports characteristics of housing units; and the National 
Health Survey examines illness, use of health care services, and health- 
related expenditures. 

Sample surveys are also important in applied social science research, 
especially by nonacademic organizations. Not only has evaluation research 
become a substantial private industry, but major organizations such as the 
Armed Forces have developed a considerable in-house capability for sample 
surveys; it has been said that the most surveyed population in the world is 
the Armed Forces of the United States; certainly the American soldier in 
World War II served the most surveyed military in history (Stoufferet al., 
1950). 

Finally, the sample survey is one of the major methodological foundations 
of the modern behavioral and social sciences. Despite widespread use in 
government and by profit and nonprofit organizations, major innovations 
and inventions in sample surveying continue to stem mainly from the ac- 
ademic social science community. Exceptions occur, primarily in the de- 
velopment of efficient means of surveying, such as computer-assisted telephone 
interviewing (CATI); yet even when such innovations occur outside the 
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academic community, assessment of their utility and continuing innovation 
generally moves within it. 

/his brief review of the pervasive effects of two major behavioral and 
social science inventions — human testing and sample surveys — illustrates 
their major impact on patterns of life in modern societies and draws attention 
to the possibility that the relatively lower scientific prestige of the behavioral 
and social sciences rests in part on their not studying the social impact of 
their inventions. 

Were there systematic investigations of such inventions and their effects, 
we might discover ti.at in postindustrial society behavioral and social science 
inventions are more consequential for social change than material inven- 
tions. Ogburn developed his theory of cultural evolution by focusing on 
the material inventions and advances in physical science and mathematics 
that contributed to the Industrial Revolution. That view scanted the great 
social inventions of earlier societies, such as bureaucratic administration 
and empires (Eisenstadt, 1963) and antedated most of modem behavioral 
and social science. 3 The role of economics in setting government policies 
and in the social control of economies has grown considerably since the 
work in Recent Social Trends. Although a president had sought the advice 
of academic social science in the "President's Research Committee on 
Social Trends," the committee seemed not to have imagined the significant 
role that behavioral and social science inventions would come to play in 
corporate organizational life and government in America. 

Ogburn believed that the cultural base of social invention accumulated less 
rapidly in modern times than that of mechanical invention (Ogburn & Nimkoff , 
1940:792). 4 This slower growth, in turn, slows the rate of new social invention. 
Yet there appears to be greater accumulation in the behavioral and social 
sciences than Ogburn expected. Rapid expansion of the knowledge base has 
been especially evident in cognitive psychology and linguistics. 

A final word may be in order here on the reluctance to examine the 
impact of behavioral and social science inventions on society and especially 
on social change. Lags in adaptation due to such inventions may be intrin- 



3 Ogbum observes en passant: "The fact that technology is at present so powerful a cause of 
cultural lags v and consequent social disorganization, does not deny that other variables such as 
social inventions or population changes are creating lags also ... the lag of social changes behind 
technological progress is simply a special case of the general phenomenon of unequal rates of 
change of the correlated parts of culture** (Ogburn & Nimkoff, 1940:893). 

4 The matter is empirical. It is not clear that the cultural base of social inventions cumulates any 
less rapidly in the modem world. Boulding (1978) argues that the homogenization of societies 
throughout the world n ay lead to less diversity in the cultural base and thus in the long run threaten 
the survival of culture. 
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sically shorter than for material inventions. But also, the dominant social 
theories have conceptualized dOcieJes as relatively stable structures, with 
an emphasis on the ways that such stable structures are maintained. 5 Models 
of social structural change seem less well developed, less often tested, and 
more focused on radical or revolutionary change than on ordered but ac- 
celerated change. 6 The literature on organizations, for example, emphasizes 
the resistance that organizations display to deliberately contrived interven- 
tions. This sirategy of theory construction and testing downplays the im- 
portant ways that inventions occur and are diffused in society — most often 
other than by deliberate intervention — and promotes the false premise that 
invention and intervention are ordinarily successful in producing change, 
except where organizational resistance is powerful enough. The contrary 
seems to be the case. Most experiments and inventions fail, or succeed in 
producing entirely unintended effects. We may learn more about how to 
produce intended effects through social invention by looking to the unin- 
tended consequences of purposive social action (Merton, 1936). 

Reduction of Cultural Lags 

Although Ogburn subordinated the roie of behavioral and social science 
inventions in causing cultural change, 7 he assigned to these sciences a special 
role in facilitating the adaptation of society to changing material culture 
(1934:166). Ogburn believed that the failure of institutions to adapt to ad- 
vancing technology produced nearly all social maladjustment and disorgani- 



5 Ogt>um (1957b:8-9) concluded that the study of social trends carries two major message*: 
"The first genera] message that knowledge of social trends brings to us is that there is much 
stability in society, even though there be a period of great and rapid social change. . . . The second 
lesson wc learn from a knowledge of social trends is th~ there is a so:t of inevitability about social 
trends. ... It is difficult to buck a social trend. It may be slowed up a bit, but generally a social 
trend continues its course. . . . Success is more likely to come 'o those who work for and with a 
social trend than to those who work against it." 

6 Antipathy toward military institutions, for example, may account for a general neglect of how 
organizations may change quite rapidly and as a consequence of social inventions. In the history 
of race relations in the United States, for example, little attention is ^iven to how the U.S. military 
organizations became egalitarian and at an accelerated rate compared with any other sector of 
American society (and that religious organizations are amoiig the most recalcitrant to change and 
racially segregated at the local level). 

7 In Part VII, "Social Change," of Sociology, Ogburn recognized that assigning a priority to 
mechanical invention is partly a function of the precision with which an invention can be dated. 
He also recognized the problem of an infinite regress of causation that complicates assignment of 
priority in social change. He concluded with a mechanical am logy: "When all the interconnected 
parts of a culture are in motion, and each part exerts a force on some other part, the origin of the 
motion cannot be located" (Ogbum & Nimkoff, 1940:866-867). 
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zation (Ogburn & Nimkoff, 1940:890). In a 1957 addendum to the theory of 
cultural lags, Ogburn (1957a: 172) reasoned that lags accumulated more rapidly 
in modem society because of the volume and accelerated rate of technological 
change. Although acknowledging that lags might be reduced by retarding the 
development of the natural sciences or following Stamp's (1937) suggestion 
for a moratorium on mechanical invention, he did not take these suggestions 
seriously, believing that such courses of action required too high a degree of 
planning and control (Ogburn & Nimkoff, 1940:890). Although the accu- 
mulation of lags was thus inevitable, it could still be reduced. For example, 
wars and revolutions reduce accumulated lags in a society (Ogburn, 1957a:172). 
Another less rad'cal way to reduce lags is thro ? ^h the technology of the 
behavioral and social sciences (President's Research Committee on Social 
Trends, 1933:166). But just how to achieve this Ogburn failed to make clear. 

The answer would have to lie in the production of knowledge-based 
innovation and invention designed to increase adaptation to cultural changes 
or to reduce the effects of their accumulation. 

Below I will illustrate two different ways in which social science — both 
basic ani applied — can function in restructuring societies in consequence 
of changes in culture. 

Statistics and Quality Control The invention and diffusion of statical 
quality control illustrates how social inventions can cope with the cultural 
dislocations caused by material and nonmaterial inventions . The coalescence 
of mechanical inventions into the modern mass production assembly-line 
factory produced the problem of assuring uniformity and high precision. 
Departures from strict production standards have consequences ranging from 
mechanical failure to increased transaction costs; these can be very signif- 
icant in competitive nrrkets or under other conditions where the tolerance 
for failure is small. 

Statistical quality control is the statistical surveillance of repetitive processes. 
Ji is used primarily for two purposes: process control to evaluate future per- 
formance and acceptance inspection to evaluate past peifomiance (Wallis & 
Roberts, 1956:495). In either type of control, samples are drawn to make 
decisions about a population. For process control, the population is an infinite 
number of exoected results from repetitions of the same process; for acceptance 
inspection, i is the quality of a finite set of existing items. 

The basic invention of statistical quality control was developed in the 
1920s by an industrial statesman, Shewhart, 8 who invented the statistical 
quality control chart (1925. 1926a, 1926b, 1927, 1930, 1931). Its wide- 



Shcwhart dates the invention of the statistical quality control chart as 1924 (1939.4). 
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spread dissemination came in the 1940s and resulted from the demands of 
the War Production Board, which deemed quality production of military 
goods essential to winning the Second World War, especially in light of 
the high quality of the German industrial complex (Wallis & Roberts, 
1956:495, 512). Wald's method of sequential analysis (1945), although 
developed initially for use in scientific research, proved so useful for ac- 
ceptance inspection that an estimated 6,000 U.S. plants used it within two 
years of its development in 1943 (Wallis & Roberts, 1956:518). 

Othe: organizational innovation accompanied this rapid diffusion. Inten- 
sive training courses in quality control were developed at Stanford Uni- 
versity and given in most major industrial centers during the war. Among 
the many consequences of diffusion was the founding of the American 
Society for Quality Control, made up largely of applied statisticians working 
in industrial applications. 9 

Ogburn concluded from his studies that the acceptance of inventions and 
their integration into cultures other than the one of origin depended upon 
the similarity of the cultures involved (Ogburn & Nimkoff, 1940:829). He 
was also disinclined to assign causal roles to individuals either in invention 
or diffusion (Ogburn, 1926). For Ogburn, the existence of independent 
invention demonstrated that the cultural base predominates over individual 
ability or uniqueness. 

Ogburn's view may be correct in the long run, but in the short-run case 
of quality control, there v/ere key individual disseminators. One of these 
was W. Edwards Deming, a government statistician originally in the De- 
partment of Agriculture and later at the Bureau of the Census and on 
independent government assignment. The introduction and rapid diffusion 
of statistical quality control in Japan seems largely due to the efforts of 
Deming. Since 1951, the Union of Japanese Scientists and Engineers has 
recognized his importance to Japanese industry by creating a major award, 
the Deming Prize, for contributions to statistical quality control in industry 
(American Statistical Association, 1983:1). 10 Some believe that the com- 
petitive margin of Japanese over U.S. products is attributable to a higher 
integration of statistical quality control in Japanese industry. 



9 Although statistical quality control was initially developed and applied in industry, the invention 
has wide applications since it is applicable to any kind of repetitive process, e.g., communicable 
diseases, medical experiments with human subjects, and accounting processes. 

10 There is no Deming Prize in the U.S., although he was honored in 1983 by the American 
Statistical Association for his contributions to "statistical quality control at home and abroad" 
with the Samuel S. Wilks Medal Award. Deming also has been decorated for his work in the 
name of the Emperor of Japan with the Second Order Medal of the Sacred Treasure. Nearinr; :ge 
83, the peripatetic Deming was absent from the award ceremony, unable to fit it into his schedule 
without a few months* notice! (American Statistical Association, 1983*1). 
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Cohort Analysis A second example of how behavioral and social sci- 
ences permit adaptation to social change is the use of cohort analysis. A 
cohort is an aggregate of individuals of similar age who are exposed to or 
experience certain events during the same period of time. Cohort analysis 
is a quantitative description and analysis of occurrences from the time a 
cohort is exposed to these events (Ryder, 1968:546). 

The continued entry of new cohorts provides a continuing opportunity 
to modify society. Cohorts consequently are central to the study of social 
change. But there also may be effects associated with age or aging per se, 
and changes brought abaMl4>y external influences or events that affect all 
people alive at the time. These three sources of change in a population are 
referred to as cohort, age, and period effects. 

A cohort analysis, as Ryder (1968:550) points out, differs from a lon- 
gitudinal or panel analysis in that the latter examine changes in the individual 
members of a population or sample over time, while cohort analysis ex- 
amines the changing characteristics of an aggregate through time: it is 
macro- rather than microlongitudinal. 

The value of a cohort analysis to our understanding of social change can 
be illustrated by the studies of changing attitudes toward racial integration in 
the United States (Taylor et al., 1978:48). Opinion polls between the 1950s 
and 1980 showed considerable shift in white attitudes favoring racial integra- 
tion. Underlying that shift, however, were different cohort trends. Although 
all age groups showed some shift with aging, this factor accounted for only 
about 10 percent of the total attitude change. Almost half of all change was 
due to the succession of cohorts in the population, with older, less favorable 
cohorts being replaced by new, more favorable ones. Almost half of the change 
in favorableness by 1980 is due simply to those younger cohorts comprising 
an ever greater portion of the population. By simple extrapolation we would 
forecast Jiat within a matter of decades the vast majority of the population 
will favor racial integration. This type of cohort analysis shows that lag re- 
ductions often occur through the mechanism of population replacement. 11 



But cohort analysis does not substitute for theoretical models of what causes particular changes. 
2n the example, we still need to explain why the younger cohorts are mos*. favorable. Is it due, 
for example, to indoctrination, to greater contact with unlike persons in environments such as 
schools, to involvement in social movements that support certain racial attitudes, or to some 
combination of these and other explanatory variables? While cohort analysis can aid us in under- 
standing changes at the population level, it does not provide a substantive theoretical explanation 
of how such changes occur at the macrole v cl of individual members of that population or at the 
microhistitutional and organizational level of changes. The failure to develop explanatory micro- 
and macromodcls of social change severely limits our understanding of it. For a more extended 
discussion and set of examples of uses of cohort analysis, see Reiss (1982b). 
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sions by estimating how cohort succession can be expected to affect the 
market for old and new products. 

CONSEQUENCES OF SOCIAL CHANGE FOR 
MEASURES AND MEASUREMENT 

Concepts and Measures as Products of Social Life 

The social process itself is the source of most basic concepts and pro- 
cedures of social measurement (Duncan, 1984:ii). All ways of knowing are 
socially organized (Biderman & Reiss, 1967), and all methods of inquiry 
into social life are subject to its substantive laws even as they attempt to 
discover and test those laws (Reiss, 1980). 

Duncan's recent work on social measurement (1984) draws attention to 
the fact that many of our basic concepts and procedures of social mea- 
surement such as voting, counting people, money, social rank, rewards and 
punishments, randomization, and sampling did not originate in the pursuit 
of scientific knowledge but rather as the consequence of practical problem 
solving. Not only do we depend upon social processes to invent many of 
our concepts and measures, but the development and maintenance of a 
social science depend in the long run, as Duncan notes, upon "what the 
society wants or allows to be measured and is able and willing to pay for. 
How it will be measured — or, in any event, the socially tolerable limits on 
concepts and measurement — is also socially determined." (Of couree, phy- 
sicists and astronomers who strugf'e for appropriations to finance massive 
particle accelerators or space science vehicles face similar constraints.) 

Not the least of purposes in acquiring information for statistical indicators 
is to gain support for particular courses of action. Florence Nightingale's 
(1858) work on the collecting and processing of statistics to monitor sanitary 
conditions in Briish army life illustrates how both the origin and institu- 
tionalization of statistical indicators depend upon social purposes and pro- 
cesses. Nightingale collected information on deaths due to preventable 
causes in British army hospitals at home and in the Crimean War and 
invented statistical graphics to depict that information as part of her cam- 
paign to improve the sanitary condition of army hospitals . She even authored 
an anonymous publication, Mortality of the British Army, to mobilize public 
and parliamentary attention to her diagrams. In private correspondence she 
referred to these unique graphics as coxcombs because of their shape and 
the red and blue colors she used (Cook, 1914:376; Funkhouser, 1937:344). 
Nightingale discoursed on the "inaccuracy of hospital statistics" and (he 
want in General Military Hospitals of a Statistical Department engaged in 
registration" (1858:288-332) and succeeded in securing such departments. 
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may be no more litigious — perhaps even less so — in matters that could 
always have been litigated. Stated more technically, measures of validity 
and reliability are themselves grounded in postulates that compromise the 
measurement of change. 

Social Change and the Organization of Ways of Knowing The mea- 
surement of social life and changes in it depends upon socially organized 
ways of knowing. The concept of a "real" or "true" rate of crime, for 
example, independent of organized efforts to detect and measure crime, is 
illusory (Biderman & Reiss, 1967). There are no rates without an organized 
intelligence system to demand, collect, and process people's accounts, 
whether the people are scientific observers, police officers, victims, or 
jurists. More generally, there are only socially organized ways of knowing 
because all criteria (and measures) for knowing, defining, and processing 
social facts lie in social organization (Biderman & Reiss, 1967:9). 

Ogburn conceptualized the organization of ways of knowing as a social 
invention, and indeed our understanding is enhanced if we see knowledge 
as the product of invertic. Many small inventions, for example, went into 
what we think of as the modem sample survey: not only inventions such 
as statistical probability, sampling, and analytical techniques, but organi- 
zational procedures to train and supervise people in acquiring information 
in interviews and to link these people togeiher in a sequential process. 

These observations have a number of implications for the study of social 
change. One is that we must understand how these socially organized ways 
of knowing change over time as a consequence of material and r.onmaterial 
technology and the effect that such changes have on our concepts and 
measures. When one moves from in-person to phone interviewing and from 
an interview schedule to a computer-assisted recording, one cannot simply 
assume that these will produce the same results, for each is enmeshed in 
a somewhat different social modality. 

The second implication is that more attention should be paid to how 
changes in organization affect the definitions of measures and to under- 
standing how information reflects the character of its collection, processing, 
and reporting. We have come to realize that processes such as illegal 
immigration affect organized ways of knowing such as censusing. Due to 
the unequal residential distribution of illegal immigrants, the resulting "un- 
dercount" of the population can affect state representation on the House 
side of Congress. The magnitude of the undercount can be reduced by other 
socially organized ways of knowing, including multiple record systems and 
new statistics 1 methods of estimation (Fienberg, 1972; El-Koratzy et al., 
1977; Ericksen & Kadane, 1983; Levine et al., 1985). But few scientists 
or others appear to realize that the very concept of an undercount is rooted 
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in socially organized and epistemologically dubious assumptions — the no- 
tion, for example, that a method exists whereby a large, mobile, dynamic 
population can be exactly counted or characterized at a fixed point in time 
by enumerating everyone who resides at a fixed location at that point in 
time. 

The Paradox of Method All methods for inquiry into nature are gov- 
erned by the substantive laws of nature at the same time that these methods 
are the means for discovering and testing those laws (Reiss, 1980:1). This 
paradox can be resolved only by approximation: improvements in methods 
will advance the formulation and testing of substantive theories, and re- 
ciprocally, improvements in substantive theory become available for ad- 
vances in method. The paradox of method can be regarded as parallel to 
Kaplan's (1964:53-54) paradox of conceptualization. "The proper concepts 
are needed to formulate a good theory, but we need a good theory to arrive 
at the proper concepts." This too can be dealt with only by successive 
approximation. 

In exploring the study of social change, it is important to grasp fully one 
of the important implications of the paradox of method: Since tests of 
theories and the development of knowledge depend upon methods, the 
development of substantive knowledge underlying methods is essential. Put 
another way, research on methods is of strategic importance because the 
development of all knowledge depends upon it. The knowledge most centra] 
to the development of any science, then, is the substantive theory and testing 
germane to its methods and measures. 

Some behavioral and social sciences depend rather heavily upon indirect 
modes of observation, 13 such as asking subjects to provide information 
about past behavior. These methods rest on theories about remembering. 
The methods are designed to retrieve information from long-term memory 
by short-term recall probes, but the theory of how subjects get information 
from the past into the present and report it is far from adequately tested. 
We readily recognize that other processes may intervene, such as forgetting 
and deception (including self-deception). 

We cannot understand the methods that rest on these postulates without 
further understanding of memory and motivation. That understanding in 
turn rests, at least in part, upon testing the theory using these methods; 
hence an instance of the paradox. Note that this observation differs from 



1 'This predilection for indirect over direct methods of observation is atypical among the sciences; 
whenever possible, the traditional sciences opt for direct observation. The predilection is not easily 
explained, though some social processes impede direr measurement (Rciss. 1968, 1971, 1976). 
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that underlying the use of multiple methods of measurement. There the 
argument is simply that any particular method taps multiple processes and 
sources of variance (Webb et al. , 1966:4). Here the point is that any method 
is inextricably woven with .ubstantive theories about the behavior under- 
lying the method. Even so, the assumption of multiple confirmation in the 
multimethod approach (Campbell & Fiske, 1959; Webb et al., 1966:5) 
depends upon substantive theories and knowledge: it rests on the assumption 
that components can be weighted according to their known extraneous 
variation and in combination by their independence from the same sources 
of bias. Indeed, one problem of a multimethod approach is to determine 
why methods are not substantially of equal weight. 14 

There seems to be no escape, then, from the paradoxes of theory and 
method. We must be prepared, consequently, to devote considerable effort 
to understanding the substantive theorii ^ that underlie our methods. Indeed, 
it can be argued that the most critical theory for the social sciences is that 
germane to its methods (Reiss, 1980). 

We must therefore draw two more implications of the paradox of method 
for the study of social change. First, concepts and measures used to in- 
vestigate social change are vulnerable to secular changes. Second, the 
methods of measuring and analyzing social change are vulnerable to secular 
changes. The problem for theorists and empirical investigators is how to 
measure social change when both the measures and that which is being 
ireasured are changing. Or correlatively, how to measure social change 
when that which is being measured is changing while the concepts and 
measures are too limited or rigid to detect it. The task seems inordinately 
complex, but must be faced if we are to scientifically understand social 
change. 

Consequences of Institutionalizing Measures of Changes 

The difficulties social scientists encounter in measuring social change 
may preclude the kind of precision we commonly associate with the physical 
sciences. The past may never be kept in such a way that we can tie it to 
the future, once we discover or invent new ways of measuring change. 
Moreover, any evolutionary or dynamic theory of change providing for 
emergence or forecasting faces problems of how to tie the present to the 
past and the future. 



,4 Thc law of evidence and of torts similarly recognizes that all methods are not to be weighted 
equally (Prosser, 1964), though it resolves the matter by legal rather than scientific criteria of 
evidence. 
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Most indicators of change emerge from social processes, and their future 
depends upon such processes. Consequently, social indicators are subject 
to at least two major types of changes. On the one hand, the broad processes 
of change affect the concept and its measures, a matter we shall return to 
below. On the other hand, the organization responsible for data collection 
will, from time to time — in response to discontinuity between the concept, 
its operational measures, and changed conditions — redefine the concept, 
substantively and operationally, or introduce new methods or techniques 
of data collection and measurement. The changes may be perceived as 
occurring within the phenomena under continuous observation or they may 
be accretions to that class of phenomena. 

Examples of how concepts are redefined in accord with changes in the 
observed phenomena are the repeated modifications of the U.S. Bureau of 
the Census in the definition of a dwelling unit and a household. These 
changes respond largely to the way that housing and living arrangements 
change from one decade to the next. The changing definition of "head of 
household" by the bureau is a response to public representations that the 
concept was biased toward older persons and males, and biased in opera- 
tional procedures and conceptualization. Other sex-linked census concepts 
include "secondary family workers" and "housewife in the labor force." 
Yet other seemingly simple census concepts vulnerable to secular change 
include "ethnic status" (vulnerable to changing patterns of marriage), "na- 
tive language," "country of origin," and "ethnic status of parent(s)." 

Examples of increments to a class occur quite commonly for legislated 
concepts. Congress, for example, mandated the collection of information 
on arson as a violent index crime in Uniform Crime Reporting. 

A somewhat extended example may best serve to illustrate how new 
concepts and new measures emerge in response to changed conditions in 
society, and how old concepts may no longer measure the same conditions 
as a consequence of change. 15 The example is drawn from the history of 
measuring employment. 

For much of our industrialized history, 16 a major way of describing the 
economy was to measure the economically active, employed population. 



i5 This example was used in an earlier discussion of this problem in measurement (Rciss, 1982b). 
Robert Parke assisted in developing that example. 

,6 There are equally fascinating questions of measuring employment in preindustrial history and 
its relationship to industrial employment. How does one conceptualize and measure employment, 
for example, in household-based economies or cottage industries? Is the notion of employment so 
closely tied to a form of social organization involving an employer and an employee who derives 
income from that work that it becomes inapplicable to earlier periods of history? In what sense 
can persons in earlier periods be described as "self-employed"? 
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This was conceptualized in terms of a work force comprised of all gainfully 
occupied workers in the nation. Moreover, each worker was considered to 
have a usual occupation, which was asked about as a means of providing 
information on labor resources (Jaffe & Stewart, 1951; Shiskin, 1976). 
Labor resources were regarded as important to jobs, yet only occupational 
status, not employment status, was usually measured. The 1930 Census, 
for example, reported statistics on the gainfully employed work force by 
their usual occupation. There were no national statistics on unemployment, 
because this usual occupational status concept of the work force did not 
include current employment status. 

A conception of unemployment linked tc the business cycle had, of 
course, existed for quite some time. But prior to 1930, interest in unem- 
ployment was largely confined to labor unions. Unions had few resources 
for systematic documentation. Unemployment during the episodic panics 
and depressions of the nineteenth and eariy twentieth centuries led to oc- 
casional estimates of unemployment and, beginning in the 1920s, a growing 
concern with the effects of changing technology led to studies of shifts in 
occupational composition, employment, and unemployment in industrial 
sectors undergoing rapid technological change. There were also estimates 
of seasonal unemployment in agriculture. But all of these estimates were 
based on a presumption that unemployment was a temporary dislocation. 

Consequently, there was surprisingly little statistical information on un- 
employment in the chapter on labor in Recent Social Trends (President's 
Research Committee on Social Trends, 1933:xvi). The national statistical 
system was unprepared to measure it. As Jaffe and Stewart (1951:7) con- 
cluded, although gainfully occupied statistics may have been useful for 
social policy in the nineteenth century, they were of little value during the 
Great Depression. Being collected only once every 10 years, they had 
especially little value in illuminating the immediate problem of mass un- 
employment. Although unreported in Recent Social Trends—perhaps be- 
cause it was only a single observation in time— the 1930 Census was the 
first attempt to measure national unemployment by asking all persons re- 
porting a gainful occupation whether they were at work the preceding day. 
That estimate, limited as it was, was soon considered far too low. Although 
sample surveys were just emerging, government agencies attempted to 
measure unemployment by surveys in 18 cities during the early 1930s and 
nationwide in 1937 in the so-called Biggers survey (Jaffe & Stewart, 1951:9). 

These early attempts to measure unemployment showed the limitation of 
the concept of a work force made up of gainful workers in usual occupations. 
Under this concept, the only people who could be counted as unemployed 
were those "established workers" who did not have jobs. Excluded from 
the unemployed were those without previous employment, such as young 
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people entering the work force for the first time. Housewives, occasional 
workers, and those who earned only supplemental income were J > gen- 
erally omitted. Not regarded as gainful workers, they would not be counted 
as unemployed when they sought but could not find work. The work force 
of the gainfully employed was subject to attrition only through retirement, 
injury, death, and emigration; it was augmented only through the maturation 
of young people and immigration (Bancroft, 1979). Of course, the idea of 
a usual occupation or a customary status such as that of housewife was 
consistent with a stable society based on fixed statuses, not a rapidly chang- 
ing urban economy with mobile factory employment. Customary statuses 
are unlikely to change in the short run; hence, measurement could occur 
at long intervals such as the decennial census. 

The rapid growth of unemployment during the Great Depression and 
public interest in short-run unemployment changes emphasized a need for 
frequent estimates of unemployment and people who wanted — indeed 
needed — jobs. This required a different concept of the relationship of in- 
dividuals to the labor market. The gainful worker concept had emphasized 
occupational status and experience as resources available to fill jobs. The 
emerging concept was a labor force made up of all persons at work, looking 
for work in the preceding month, or temporarily laid off with an expectation 
of being called back to work. This concept emphasized the current activity 
of individuals, and what emerged was monthly measurement. Under work 
force concepts, an increase in the gainfully occupied necessarily meant a 
decrease in the unemployed. Under the labor force concept, the unemployed 
arc measured somewhat separately from the employed. The numbers of 
employed and unemployed can both rise (or both fall) if there arc persons 
entering the labor force from among those not in the labor force, such as 
first entrants, housewives, students, retired persons, and those discharged 
from military service. 17 

Much, then, has changed: the concept of participation in a labor market, 
he ways that employment status and activity are measured, and the fre- 
quency with which measures arc taken. Yet the matter does not end there. 

In 1961, President Kennedy appointed a Committee to Appraise Em- 
ployment and Unemployment Statistics, partly based on the awareness that 
the notion of unemployment had both conceptual and operational difficul- 



1 The consequences of this quasi-independent variation can be a political paradox. In the 1976 
Foid -Carter debate. Carter correctly claimed that the number of unemployed had risen in the most 
recent month and President Ford correctly responded that the number of employed had risen. Four 
years later, the same claims were correctly made in the Carter-Reagan debates with Carter's role 
reversed. (This example is not offered to suggest there invariably is a presidential statistic and a 
candidate statistic on the labor market!) 
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ties. The line between a person looking for work and not looking was 
vague, and the reasons for not looking for work were many. In the end, 
the committee decided to exclude 4 "discouraged" workers, persons who 
had given up looking for work because they saw little chance of getting a 
job. In 1967, the Bureau of Labor Statistics implemented that recommen- 
dation and excluded a substantial segment of the unemployed by restricting 
the concept of looking for work to those searching for work during the 
preceding four weeks. At the same time it began to record persons as 
"discouraged" if they said they were not looking for work but would work 
were there a job for them. 

As recently as 1979, Finnegan advised the National Commission on 
Employment and Unemployment Statistics (1979:215-217) that while the 
practice of reporting discouraged worker statistics — measured every quarter 
rather than monthly — should be continued, this group should not be counted 
as unemployed. But the matter did not end there either. Discouraged workers 
are disproportionally distributed among persons under 20 years and over 
60 years of age and among blacks and women. Should such persons not 
be regarded as in some way unemployed? Clearly, short-run as well as 
long-run changes in the society are at the heart of such conceptual and 
political debates. 

Our example of unemployment highlights a central problem in the mea- 
surement of social change — that the concepts, operational definitions, and 
measures used to chart this change are themselves changing over time as 
a result of social processes. What seems called for is far more inquiry into 
how one adjusts concepts and their measures over time. Splicing of mea- 
sures, synthetic estimation, and multiple measures all provide some means 
for coping with this issue. Yet there is all too seldom provision for adjusting 
statistical series to secular changes. 

Some Consequences of the Organization of Statistical Indicators 

No society could afford the effort to acquire and organize all of the 
statistical information that might usefully enter into decisionmaking. But 
societies vary considerably in the extent to which they institutionalize sta- 
tistical indicators and the uses to which they are put. Japan, for example, 
has a national statistics day; the United States does not. The classical 
bureaucracies of Japan and some other countries such as France may have 
institutionalized a respect for statistics that is less well developed in the 
United States. There is no national statistical system in the United States, 
in the sense that no one really knows what statistics are collected annually. 
Many private organizations collect and con ^ile annual series. Any major 
university has probably several hundred such series. Even individual mem- 
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bcrs of a population keep their own statistics. Indeed, were an inventory 
taken of statistical series, government series would probably contribute the 
minority. 

Our concern in this section is with the consequences of the current 
organization of statistical indicators for monitoring and investigating „ xial 
change. Perhaps the major consequence of the current organization of our 
statistical system is that we cannot readily compile them into meaningful 
aggregates beyond the level at which they are collected. Judicial statistics 
are gathered for every court in the land, but it is difficult to combine more 
than a small part of them into state (much less national) statistical series. 
Conversely, most national statistics are collected in such a way that we 
cannot make local estimates from them. This is especially true for series 
collected by the survey method, but it is so for many other modes of data 
collection and compilation as well. 

A second major consequence is that statistics gathered by private orga- 
nizations genertJy are inaccessible for aggregation or analysis unless re- 
porting is centrally coordinated and controlled. Private organizational data 
are seldom compiled unless there is legislation creating a voluntary or 
mandatory system of reporting. As a consequence, we cannot measure very 
much change using that vast resource. 

A third consequence is that the major developmental and analytical re- 
sources are concentrated in federal statistical systems designed to meet 
particular needs of federal legislative, executive, and judicial agencies. The 
rcculting lack of attention to local variation may have the consequence that 
matters requiring collective attention as well as social changes come to be 
defined for a national aggregate rather than in terms of their local variability. 
To the degree that statistical information is important in reaching decisions, 
this imbalance may bias toward federal rather than state and local adaptation. 
Countries with central statistical systems such as France were historically 
organized to gather information for each department and unit thereof. It 
might be useful to learn more about the role such regionalized statistical 
systems play in social change in contrast with the United States. I would 
draw attention to the problem of developing concepts and measures of social 
change that meet agreed-upon requirements of all levels of government. 
Volunteer sample surveys, rotating panels, and synthetic estimates are some 
of the ways of doing so. This is an area for social invention. 

THE CONCEPTUALIZATION AND MEASUREMENT 



Most current theories and models of social change are deficient in ex- 
planatory precision and predictive power. This section considers some of 
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the ways that contemporary models and concepts could be improved to 
better our understanding of social change. 

Individualistic Biases in Studying Social Change 

The dominant theories of social life in the United States postulate indi- 
viduals as basic units and especially presume individual actor? vho make 
rational decisions. References to collective choice ana organizational actors 
are often translucent, magnifying postulates about the behavior of individ- 
uals to collective actions. Durkbeim's vi v *w of society as a reality sui generis 
is honored more in the breach, as interpretative commentary. 

Ogburn was well aware of the domination of individualism in explaining 
social change, formulating the problem as the role of the "great man" versus 
"social forces" (Ogbum, 1926). His own wc 'mown view was that great 
men are but a medium of social change (1926:231). Historical theories, such 
as those of Sorokin (1937-41, 1943, 1947), likewise assign a key role to 
social forces, which dominate the behavior of individual actors. But the con- 
trary emphasis upon individual actors and individual welfare still dominates 
miK . contemporary theory and research, biasing the treatment of social change. 

Earlier I used unemployment as an e ;ample of a major social indicator. 
Unemployment is a characteristic attached to persons. So is the concept of 
the discouraged worker, based as it is la motivation cf individuals in a labor 
market. Even though it is apparent to labor economists that employment 
attaches to jobs, we lack indicators of job vacancies per se; there is no national 
indicator of jobs comparable to that on unemployed individuals. 

In his ground-breaking studies of vacancy-chaining, Harrison White (1970) 
noted that social scientists had focused on social mobility as individual move- 
ment through jobs, neglecting the fact that a job, as a position in an organi- 
zation, must open up or become vacant to constitute an opportunity for mobility. 
White studied movements of vacancies through bureaucratic hierarchies — how 
vacancies are filled and how the filling of positions relates to organizational 
structure and process. Positions and opportunities ten . t o be linked in chains, 
and these constitute the relative openness to upward mobility in a bureau- 
cracy. 18 Although recent studies of social mobility have examined how or- 
ganizational structures affect occupational mobility, defining these as opportunity 
structures (Posenbaum, 1976, 1984), the study of mobility has not shifted to 
focus, cn changes in opportunity structures themselves. 
To the extent that theories dictate what is problematic, they also dictate 



r ^nthetically we note that the trickle-down market allocation theory of housing in economics 
couic t tested in a vacancy-chaining model. 
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the choice of measures. There is a predilection for developing instruments 
to secure information from and about individuals rather than from and about 
organizations, and our concepts and measures more often refer to individual 
statuses than organizational positions. 

The individualistic bias certainly predominates in methods development. 
The sample survey is built primarily around sampling individuals and is 
poorly developed to secure information about organizations. Indeed, most 
methods for collecting information on organizations actually collect infor- 
mation about individuals in organizations or rely on individual surrogates 
for the organization. Far too little attention is paid to measures other than 
surveys (Sinaiko & Broedling, 1976). 

Even when organizations are the object of study, the line of discussion 
is affected by the individualistic bias. For example, Kish (1965) noted that 
around the time of the first Sputnik, about half of U.S. high schools offered 
no physics; a quarter, no chemistry; and a quarter, no geometry. He then 
noted that this did not tell us how many high school students could take 
courses in chemistry, physics, or geometry, for the schools offering no 
such courses, though large in number, were snail in size, accounting for 
only 2 percent of all high school students. 

It is clear that average school chaiacteristics would give a misleading 
description of conditions for the average student. But Kish failed to point 
out the relevance of the original organizational statistic for deploying col- 
lective resources. For some purposes the distribution of schools is critical, 
e.g.. if a government nought to equalize educational opportunities for all 
students. To do that, the government would have to merge schools or divide 
resources among existing schools; and in either case, a iarge number of 
schools would be involved. Fifty percent of all high schools, for example, 
might need a new physics laboratory and an instructor trained to offer 
physics. As a consequence, the market for physics teachers might change 
drastically and organizational consequences on teacher recruitment and training 
would be considerable. One can readily imagine a whole train of organi- 
zational, structural, and individual consequences stemming from how one 
reads these statistics and decides to act on them. 

We routinely conceptualize and measure the size and composition of 
populations of individuals but have only recently come to think seriously 
about doing so for populations of organizations. 19 Yet the size of the or- 
ganizational population in the United States is greater than that of hidivid- 



,9 Nctworks arc even more complex. Consider that apart from unmarried siblings, the kinship 
network is never the same fc any two individuals For further discussion of research on organi- 
zations, see Hannan, in this volume. 



70 



MEASURING SOCIAL CHANGE 



61 



uals. Consider that a household is a form of organization, that it is not co- 
terminous with the family is a form of organization, and that these may 
be regarded as two distinct populations of organizations. Alexis de Tocque- 
ville sensed the multiplicity of organizations when he characterized America 
as a society of joiners; yet he focused primarily on the individual charac- 
teristics of the joiners and less so on the fact that American society was 
creating an enormous number of organizations for individuals to join, many 
such organizations persisting well beyond the involvement or the life of 
those who founded them. 

The suggestion here is that organizations may play as great, if not a 
greater, role in social change than individuals, and that the bias toward 
individualism fails to take into account how populations of organizations 
are both causes and effects of such change. This, indeed, is not the only 
consequence of the individual bias; other units are neglected as well, such 
as units of culture. It has perhaps seemed simpler to count individuals when 
faced with the difficulties of devising and counting organizational popu- 
lations or cultural products. 

Ogburn held that change in material culture — invention — is fundamental 
to social change. One test of his theory required counting the numbers of 
inventions so that one could calculate the rate of growth of the technological 
base. Although he used patents to count the growth of invention, he rec- 
ognized the limitations of this indicator. It perhaps is unfortunate that he 
did so litde to try to count social inventions. 

Ogburn' s neglect of how one conceptualized and counted social inven- 
tions should be seen in Historical perspective. Societal intelligence on the 
growth of science and technology is little advanced over Ogburn's day. At 
the core of counting inventions are conceptual problems of what constitutes 
invention and of how one measures the growth of knowledge in the sciences. 

Some 30 years ago Lazarsfeld and Barton (1951) wrote a piece on mea- 
surement in the social sciences for a volume edik.1 by Lerner and Lasswell 
on the policy sciences. They drew attention to the fact that while the 
individual was a primary unit of observation and measurement, there are 
also units that are not based upon the primary characteristics of individual 
members precisely because no individual data correspond to them. A com- 
munity can have a speed law; individuals cannot. A community can be 
characterized in terms of the proportion of its members who violate those 
laws. A primary characteristic of a unit, they noted, had to be distinguished 
from its analytic characteristics, which refer to component elements. Or- 
ganizations may be seen in terms of their individual members or in terms 
of properties that cannot attach to individuals. 

Although individual analytic characteristics may help explain social change, 
as, for example, the proportion of a society's manpower that is employed 
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in science, individual scientists do not and cannot have a technological 
base. The neglect of the latter for the former data may account for our 
being in a disadvantageous position to theorize on and measure social 
change. Even in our use of cohorts — consider the examples used earlier in 
this paper — we usually look at cohorts of individuals, rarely at cohorts of 
organizations. Bankruptcy is generally expressed in annual rates rather than 
survival rates in a birth cohort of organizations. 

The study of social change should focus, then, much more on the primary 
characteristics of organizations, which should be regarded more in terms 
of functional sub units than participating individuals. We can readily see 
that resources (such as laboratories) and relational properties (such as hi- 
erarchy) are primarily characteristics of organizations, not individuals. We 
must systematically collect better information on characteristics of orga- 
nizations and units of material and nonmaterial culture, to use Ogbum's 
terms, if we are to understand cultural and social change. 

Individual versus Collective Welfare There is a bias in welfare models 
of human behavior toward optimizing or maximizing individual welfare 
rather than the welfare of collectivities such as organizations. 20 Trade-offs 
commonly are seen in terms of individual rather than collective costs and 
benefits. The quality of life is measured in terms of individual rather than 
collective units: Is this community a good one for scientists rather than for 
science? Is the housing stock fit for individual habitation rather than what 
kind of collective life is possible, given the housing stock? How ore asks 
the question can make a difference. We look at the crime rates of com- 
munities in terms of victimizations in a population of individuals, neglecting 
the high rate of victimization of organizations and collective property — 
parks, schools, playgrounds. We look at individual careers in crime rather 
than at community careers in crime (Reiss, 1982a, 1983); yet the latter 
may explain much of the former. 

Concepts such as justice, social cohesion, and social integration are not 
reducible to the lives of a society " individual members, nor can they be 
measured simply by summing observations for individuals. Changes in 
particular social indicators can hive collective and individual effects. A 
change in the divorce rate, for example, is both a change in the status of 



^Measures of social welfare as conventionally defined in political economy should not be 
confused with measures of collective welfare. Measures of social welfare typically are based on 
the concept if a collective consensus based on individual preference scores. Although such pref- 
erence measures may be technically infcasible, they presume that consensus measures on welfare 
preferences opi'mize or maximize collective welfare, which is an empirical matter. 
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individuals and a change in social relationships and organizational structure 
of society. Most divorces increase the number of single-person households 
and decrease the number of two-or-more-person households. Divorces alter 
the relationships of husband to wife, child-en to parents, insurers to insured, 
and the taxable income and legal status f the parties, to mention only a 
few consequences of changes in the divorce rate. Such changes may produce 
chain reactions. The divorce rate can have a substantial effect on the size 
and occupancy rate of the housing stock, which may affect the burglary 
rate (a crime against housing units rather than individuals per se). 

Understanding social change would thus seem to require understanding 
of collective as well as individual welfare, and how changes in collective 
welfare are consequential for individual welfare. We may need to think 
more about the well-being of science in society, less about the consequences 
of science for the quality of individual life. Controversies over the risks of 
science must be viewed not only in terms of the risk to individuals, such 
as by gene splicing, but also of how the failure to do gene splicing research 
may affect the st* *e of science in a competitive order of societies. 



Lags in Measuring Social Change 

Ogburn edited an annual series of the May issue of The American Journal 
of Sociology called Social Changes in [Year] from 1928 to 1935 and in 
May 1934, one entitled "Social Change and the New Deal." In the early 
volumes, Ogburn made clear that his purpose was not that of editing a 
conventional yearbook but rather 4 'scientific analyses of social change ..." 
(1929) The Great Depression, with its marked ->cial changes, had con- 
sequences for the publication of his annual series. In his introduction to 
Social Change in 1932 (1933:823-824) he observed: 

The American Journal of Sociology has itself been influenced by these economic changes, 
and a policy of retrenchment in the interests of economy has affected the size of this 
special issue. We have had to reduce the number of the articles, as it did not seem 
possible to reduce the length of the articles further and have /icm of any scientific 
merit. In order to do this, some of the topics covered regularly in the annual "Social 
Change" issue have been omitted. ... In some cases the omission of certain topics is 
not a particularly serious loss because extensive data arc not always collected every 
year in sufficient volume to note significant changes, and a two year interval will show 
the changes most clearly. This is true, for instance, in the case of social legislation. 
Most of our state legislatures meet only once in tv o years. 

The effect that changes in society can have upon its intelligence system is 
disclosed here rather dramatically. 
A second issue is also evident here. With what frequency shall we collect 
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measures of social change? Ogburn calls our attention to the fact that 
frequency of measurement is in part tied to the social processes themselves. 
Changes of some kinds — especially those he would have characterized as 
adaptive — are institutionalized, such as in the periodic meeting of legis- 
latures. The response to change will determine in part the scale and fre- 
quency of measurement and thus the capacity of science to detect and 
measure such changes. 

A third point was also mentioned briefly in Ogbunf s introduction — the 
problem of lags in our intelligence on social change. He notes in particular 
the lag between an event, its measurement, and analytical understanding 
of it (1933:823-824): 

There are few aspects of our social life that have not been markedly affected by this 
most severe economic depression of modem times. The papers in this volume indicate 
many of these changes and their effects. The extremely dramatic events, which began 
in the latter part of February and reached a climax in the most extensive closing of the 
banks ever known, have particularly significant effects. These, however, are not re- 
corded in this volume, which is restricted to 1932. Some time has to elapse after an 
event for the data to be collected and recorded so that it is possible to submit them to 
scientific analyses. News events are almost simultaneous, but there must be a lag before 
the scientific analyses can occur. 

Here we see a major and continuing issue in conceptualizing, measuring, 
and monitoring social change — that of how our intelligence systems can 
be developed to collect information on events as they take place and how 
we can reduce the lag between collection of information and scientific 
analysis. We often fail to collect information rapidly that is essential to 
scientific analysis while, at the same time, far more information lies in our 
collection systems than we can process. How to resolve these problems is 
not altogether clear. A good theory helps, but data collection also depends 
upon social processes. 

Forecasting and testing likewise depend upon these processes. Since 
Ogburn's day, great strides have been made in time-series analysis by 
developing forecasting models and identifying causal and "leading indi- 
cator" models. Despite the inevitability of some lags in analysis and model 
testing, more attention still needs to be given to short-run forecasting as 
well as long-run theories of social change. Our capacity to measure and 
monitor social change depends upon using and encouraging all processes 
that represent investments in knowing about and understanding social change. 
Economics in particular has both a macro theory of change and methods 
of short-run forecasting. The theory develops episodically and partly as a 
consequence of social change; such concepts as stagflation emerge to cope 
with the inadequacies of the theory and to fit it more closely to a world 
"out there." Economics may have progressed rapidly precisely because it 
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forecasts. Forecasts that fail are crucial steps in learning about theories and 
discovering where their weaknesses lie. But that means we must build 
models of social life so that it can be forecast. Both demography and 
economics have done so and learned much by failed forecasts; other be- 
havioral and social sciences might well take note. At this juncture of theory 
building on social change, social change itself is the way to theory building. 
Thus, theory testing and failed forecasts may be the best paths to scientific 
understanding of social change. 



Need for a National Statistical System 

Although one can easily demonstrate that our national indicators of eco- 
nomic and social change are more highly developed in some areas than was 
true on publication of Recent Social Trends, in many areas there has been 
little improvement. For example, we still have few national indicators of 
legal change, aad we rely almost exclusively on ad hoc surveys to monitor 
changes in values and value practices such as religious belief and obser- 
vance. This discontinuity and variability in indicator development, collec- 
tion, and reporting amounts to a failure to develop the national statistical 
system Ogburn en iSioned. Some of this is due to benign neglect by the 
social sciences since World War II of macro social change in advanced 
societies. 21 One of the requirements for developing and testing theories of 
social change is a set of concepts and their indicators measured over time, 
within the domain of a national statistical system. 

I note three salient conclusions about requirements for a national statistical 
system: 

First, we require research devoted to building explanatory models of 
social change in order to structure a national statistical system that can 
usefully measure and monitor this change. 

Second, because of the limits of present model? of social change and 
underinvestment in their development and testing, we generally lack data 
on potential explanatory variables for the trends that are monitored and 



1 World War n appears to have been a historical dividing point in the study of social change. 
In the postwar period, the fashion in studying social change shifted to the "third world/* the 
"developing nations," and "economic and cultural development." Modeling efforts shifted to 
how one might simulate the growth of economies and, increasingly for the non^conomic sciences, 
to the effects of rapid social change on traditional cultures. Although this latter interest fell within 
the domain of Ogburn's lag formulation, its shortcomings (see Smelser. in this volume) failed to 
generate interest in revising the model 
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measured; the major exception occurs for selected models of the economy. 22 
Generally, when explanatory variables are collected, their analysis is ad- 
ministratively segregated from analysis of the trends as such, substantially 
reducing the ability to test theoretical models. There are substantial problems 
in linking and analyzing extant information for like units ( T -na & Hannan, 
1979a, 1979b) and myriad problems stemming from lack ot standardization. 
As a result, census variables such as age or years of school completed have 
lo become surrogates for almost every conceivable explanatory concept. 

Third, we lack an adequate system of indicators of science and tech- 
nology. 1. j annual report of the National Science Board, Science Indicators 
(1983) has almost no major indicators to monitor substantive changes in 
science and technology, much less a set of explanatory variables related to 
such changes. Most glaring is the absence of indicators for the behavioral 
and social sciences and technologies based on them. The failure to collect 
information on the content of science and technology, especially on inven- 
tions, has two important consequences. One is that we do not measure 
changes in the rate of behavioral and social science inventions and tech- 
nology and their contribution to contemporary society. The other, and a^re 
important, is that we are unable to measure relative contributions to sociai 
change, especially the contributions of behavioral and social science and 
technology compared to that of physical science and technology. 



The decades since the publication of Recent Social Trends ha e been a 
period largely of benign neglect by the behavioral and social fences in 
modeling and measuring social change, economics being the major excep- 
tion. This neglect may owe in part to the reticence of theorists, save for 
economists, to address matters of social change. But scientific knowledge 
shapes and is shaped by such change; it becomes practically meaningful in 
the context of what kind of change is* or seems, possible; and it is tested 
against these consequences. It may be well to remember that Ogburn, ; - 
Duncan (1964:vii) calls to our attention, "saw science primarily as an 
accumulation of knowledge, but an accumulation whose structure is subject 
to continual change as new relationships among its parts are perceived or 
as discoveries shed new light on supposed relationships/* 

PerhtyS the period of benign neglect is drawing to a close, in which 



In acknowledging this, attention is also drau.ii to the dissatisfaction vuth the precision of 
measures of variables estimated in structural cquatio i models of the cl ononis i.r of leading in- 
dicators (sec Klein, in this volume). 
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event it is essential to attend to the kinds of problems touched upon in this 
essay. We must understand better the role of behavioral and social science 
knowledge and inventions in social change. We must examine the effects 
of social change on theory, concepts, and measures, including their capacity 
to record and render social change intelligible. Time has favored Ogburn's 
conviction that statistical intelligence systems have a critical role to play 
in the processes of science and in society as a whole. 

* * * 

I wish to thank Otis Dudley Duncan and Barbara Laslett for their helpful 
substantive comments. 
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Uncertainty, Diversity, 
and Organizational Change 



MICHAEL T. HANNAN 



There is in our social organizations an institutional inertia. . . . Uniess there 
is a speeding »»p of social invention or a slowing down of mechanical invention, 
grave maladjustments are certain to result. (President's Research Committee 
on Social Trends, I933:xxvii.) 

How difficult is it to reshape complex organizations when conditions 
change? Ogburn's (1933) work on technical innovation and society built 
on the premise, illustrated in the quote above, that organizations and social 
institutions strongly resist change. h w argued that the combination of rapid 
technical innovation and organizational inertia disturbs equilibria. Long 
periods of disequilibrium caused by lags in adjustment of social structures 
to hanging material conditions can have high social costs, as Ogburn and 
his collaborators insisted 50 years ago. 

Despite the seeming ubiquity of organizational inertia in everyday life, 
the social science literature has sometimes painted a very different picture. 
Both organizational theorists and specialists in management have often 
described a world in which organizational adjustment to changing external 
conditions is almost friction-free. March's < ^81:563) review of the liter- 
ature on organizational change notes this uon inant theme: 

Organizations are continually changing, routinely, easily, and responsively, but change 
within organizations cannot be arbitrarily controlled. . . . What most reports on im- 
plementation indicate ... is not that organizations are rigid and inflexible, but that 
they are impressively imaginat' c. 

Which is it? Are organizations subject to strong inertial pressures as 
Ogburn has it? Or do they change easily and routinely as March claims? 
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This disagreement raises fundamental issues about the relation of organi- 
zations and society, issues that have important theoretical and practical 
implications. 

If change in organizational strategies and structures is rapid and smooth, 
it is reasonable to respond to changing conditions by continually iedesignmg 
existing organizations. But if organizations typically respond slowly or not 
at all to changing opportunities and threats in their environments, it may 
make more sense to continually replenish the stock of organizations. These 
alternative strategies imply vastly different social policies. 

The disagreement between Ogburn and March reflects more than a gen- 
erational shift in organizations theory and research. Contemporary opinion 
among organizational researchers splits sharply on these issues. Questions 
of organizational inertia are fundamental to understanding organizational 
structure and change; thus the opposing opinions voiced by Ogburn and 
March, which continue to divide current researchers, provide a useful frame- 
work for considering past and present theory and research on organizational 
change. 

inertia is only one of several noteworthy factors affecting the adaptability 
of organizations to environmental uncertainty. At least as important is the 
diversity of organizational forms in society — the stock of organized solu- 
tions to problems of producing collective action in variable settings. Trends 
that eliminate organizational diversity lower the capacity of social systems 
to deal with uncertain environmental change. 

Questions about diversity and inertia are especially important when 
change in technical, social, and political environments is uncertain. If 
environments are highly stable (and thus certain), there is really no 
continuing problem of organizational adaptation. It will become clear 
eventually which forms of organizations are well suited to the stable, 
prevailing conditions (either by differential selection or by learning and 
imitation). Likewise, if environments change in predictable ways (for 
example, seasonal changes in demand for energy, Christmas trees, and 
other commodities), even highly inflexible organizations can schedule 
adjustments far enough in advance to match strategy and structures to 
these changing states. 

Issues of organizational inertia and organizational diversity are im- 
portant to understanding modern social change. This essay describes the 
development of theory and research on organizational processes as it 
bears on these questions. It also suggests new lines of inquiry that might 
better clarify the relations between organizational change and large-scale 
social change. In particular, it discusses recent theory and research that 
consider organizational diversity and change from ecological and evo- 
lutionary perspectives. 



UNCERTAINTY, DIVERSITY, AND ORGANIZATIONAL CHANGE 



CENTRALITY OF ORGANIZATIONAL PROCESSES 
IN LARGE-SCALE SOCIAL CHANGE 

Most theories in the social sciences emphasize the actions of autonomous 
individuals, interest groups, social classes, and institutions rather than those 
of concrete organizations. But almost all modern collective action takes 
place in organizational contexts; organizations are the main actors in modern 
society (Coleman, 1982). When interest groups and social classes take 
collective action, they do so using specific organizational tools such as 
labor unions, political parties, or terrorist groups. Recent research shows 
that even relatively amorphous social protest movements have a higher 
likelihood of success if they can use existing organizations (Tilly, 1978). 
The state, which has become the focus of so much social action, is ibelf 
an organization (or perhaps a hierarchy of organizations). Struggles for 
power and control in modern societies typically involve struggles between 
competing organizations for privileged positions in the state structure as 
well as struggles between the state and other kinds of organizations. 

Organizations are also important in modern societies because of the role 
they play in creating, promulgating, and enforcing social norms. The cod- 
ification of norms as explicit, legitimated organizational rules gives such 
rules great force. Organizations typically develop formalized roles and 
procedures for enforcing these rules. Employment contracts, for example, 
have more continuous and binding effects when labor unions monitor com- 
pliance and take action to enforce them. 

Because organizations are key actors in modern society, the speed and 
direction of large-scale social change are constrained by organizational 
dynamics. In particular, the responsiveness of society u charging condi- 
tions depends on the inertia of its constituent organizations and on the 
diversity of its stock of organizations. 

The problem of matching outputs of schools to the needs of a changing 
economy illustrates the "roblem. It has long been evident that American 
school systems were failing to t^ach enough mathematics and science to 
all but the richest and most able students. Over the past several years a 
series of national commissions has identified this situation as a 44 national 
problem" and urged immediate and far-ranging reforms of U.S. public 
education. Some commissions urge more attention to teaching (and re- 
quiring) more mathematics, science, and computing; others emphasize at- 
tention to writing. All agree that the quality of teachers needs to be upgraded 
and that more time must be allocated to teaching. A broad consensus seems 
to have emerged on the definition of the problem; federal and state officials, 
school district officials, legislators, school employee unions, and parents' 
groups all urgs reform. 
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How quickly can the national system of public education be reformed? 
Despite the fact that many states have imposed new rules and constraints 
on school systems, t? ere are a number of reason*, for suspecting that change 
in the actual organization of sci K)ling will be halting at best. The demo- 
graphic and institutional constraints on change in this system are very 
powerful. Consider the problem of upgrading the technical knowledge of 
teaching staff* Tn a period of declining enrollments, school staffs have 
been shrinking (although there has recently been an upsurge in demand for 
science teachers). Given the "last-in, first-out" policies favored by bur- 
eaucracies and demanded by teachers' unions, change in the composition 
of teaching staffs will be glacially slow without some radical alteration of 
employment policies. Any such radical change is sure to encounter stiff 
resistance from unions, as well as legal challenges. A radical change in 
policy may also mobilize previously quiescent grcups. 

The complexity of the organizational networks involved compounds the 
adjustment problem. There is no unitary chain of command; rather there 
are multiple, partially overlapping jurisdictions of local, state, and r ederal 
agencies, with no central planning mechanism Change in any one sector 
is hampered by overlaps with others. For example, the seemingly simple 
problem of changing textbooks in public school systems is made very 
complicated by the organizational arrangements. Many different organi- 
zations and individuals must be consulted; any one of them can forestall 
the change. 

Implementing even a broad and powerful mandate to change the edu- 
cational system means changing many organizations and their interlocking 
connections. The whole system responds only as fast as the slowest com- 
ponent organizations. 

Similar issues arise in industry, although the processes are different. In 
recent years, a number of highly concentrated American industries such as 
steel, automobiles, and agricultural and construction machinery, hav2 stum- 
bled before more efficient foreign competitors. The giant American firms 
in these industries adapted their strategies and structures to earlier technical 
and social conditions, and *hey have been ponderous at best in responding 
to new challenges. Firms in these industries have relied on political muscle 
to obtain favorable government intervention to limit competition, as we 
have seen in the auto and steel industries. Success in this tactic serves 
mainly to further delay radical change in industrial strategies and structures. 

Global national policies like "reindustriaiization" imply massive change 
in the structures of thousands of organizations. Whether such policies can 
proceed quickly enough to meet international competition and lapidly changing 
technologies depends largely on the responsiveness of existing firm* in the 
economy and on the rate at which new firms cin be created and brought 
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up to speed. Analysis of such policies requires knowledge of the dynamics 
of organizational populations. 

The discussion to this point has considered organizations as passive brakes 
on social changes initiated elsewhere in the society or in the environment. 
But the image of organizations as passive is seriously misleading. Of course, 
organizations are constructed as tools for specific kinds of collective action. 
For example, agents invest resources in hospitals or armies in the hope of 
achieving specific kinds of performances. But one of the main contributions 
of organization theory and research has been to show that organizations are 
far more than simple tools. 

Organizations consume great quantities of resources in merely maintain- 
ing their structures. Because great quantities of resources are used tor 
organization building and for bureaucratic or administrative overhead rather 
than for production or for collective action, organizational politics often 
revolve around issues of resource allocation (Cyert and March, 196 1 ; Pfef- 
fer, 1981). Organisational politics makes problematic the relation between 
technical needs for production and actual distribution of resources. Subunits 
strive to protect and expand budgets and staff sizes. The resulting com- 
petition for fixed resources is especially severe in times of contraction or 
decline (Freeman and Hannan, 1975; Hannan and Freeman, 1978). Because 
allocations within organizations are subject to intense political contest, 
organizational action depends on the dynamics of political coalitions. Or- 
ganizational politics often makes collective action deviate from ostensible 
goals, from the demands of relevant environments, and from the intentions 
of organizational leaders. 

For these various reasons attempts at understanding patterns of large- 
scale change in modern societies (or relations between public policy and 
actual implementation) require detailed attention to organizational processes 
and dynamics. 

Rational and Natural System Perspectives 

Systematic organizational theory began when bureaucratic forms gained 
ascendency as ways of organizing the activities of the state and of large 
industrial concerns. German sociologist Max Weber, the founder of socio- 
logical organizational theory, emphasized the importance of the spread of 
bureaucracy to the spread of norms ot itionality. Bureaucracy, which is 
built on formalized rules, explicit spheres of competence, and full-time 
professional staff, permits rapid, efficient, and calculable response to ad- 
ministrative directives. In Weber's (1978:973) view, 

The decisive reason for the advance of bureaucratic organization has always been purely 
technical superiority over any other form of organization. The fuiiy developed bureau- 
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cratic iLechanism con pares with other organizations exactly as does the machine with 
the non-mechanical nudes of production. 

Because it is precise and efficient and because it can (in principle) serve 
the interests of any who come to control it, bureaucracy is practically 
indestructible in Weber's view. 

Weber's insistence on the machinelike character of modern bureaucratic 
forms was echoed in this country by Frederick Taylor, the founder of the 
school of organizational design called Scientific Management (see Perrow, 
1979, for a detailed examination of this school). Taylor described smoothly 
functioning organizations in which all tasks were broken down into minute 
component according to the logic of "time and motion strdies." Research 
in this tradition sought to learn optimal designs for such organizational 
machines. Much work in this tradition, for example, tried to discover the 
optimal "span of control" for industrial organizations, the ratio of super- 
visors to workers that maximizes efficiency. 

Much subsequent work in sociology, industrial psychology, industrial en- 
gineering, and economic* was shaped by broad assumption that organi- 
zations are efficient, impersonal tools for production, administration, and other 
forms of collective action. Scott (1981) provides a careful summary and critique 
of work in this "rational-systems" perspective. This approach has produced 
much useful knowledge, especially about the conditions under which formal 
organizations have efficiency advantages in coordinating complex work. Many 
empirical findings of this tradition have become the conventional wisdom of 
rnanagement and public administration theory. 

This perspective also continues to guide much current research. For 
example, an important development in economic theory of organizations 
argues that organizations are often able to minimize the costs of completing 
economic transactions when markets fail due to imperfect information, 
cognitive limitations on the ability to process information, and opportunism 
(Arrow, 1974; Williamson 1975). 

Although the rational-systems perspective continues to shape research on 
organizations, most sociological research has long made an opposing ar- 
gument. As early as 1915 German sociologist Robert Michels, who agreed 
with Weber that bureaucratic forms were indir disable for efficient col- 
lective action, argued that bureaucracies seldom pursue their ostensible 
goals. Ho clain"^d that organizations are subject to an "iron law of oli- 
garchy." An organization requires expert leadership even when it is de- 
signed for democratic and collective ends, as in the case of labor unions 
and political parties. As leaders learn skills of managing and become dif- 
ferentiated in prestige and lifestyle from the mass membership, they develop 
interests in preserving the organization (and their privileged position) at 
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any cost. They also develop the capacity to control organizational decisions. 
Thus, Michels argued, leaders typically can and do subvert the goals of the 
organization to minimize the risk that the organization will be destroyed. 
According to Michels (1962:364-365), 

. . . the principal cause of oligarchy in denweratic parties is to be found in the technical 
"^dispensability of leadership. . . . Reduced to its most concise expression, the fun- 
damental sociological law of political parties (the term "political" here being »ised in 
its most comprehensive significance) [is] 4< It is organization which gives birth to the 
domination of the elected over the electors, of 'die mandatories over the mandators, of 
the delegates over the dekgators. Who says organizations, says oligarchy." 

Micnels described a process by which an organizational tool takes on a life 
of its own. One result is that organizational action becomes highly unpre- 
dictable from knowledge of public goals and interests of its numbers. This 
insight has been amplified by numerous studies in the so-called "natural- 
systems" perspective (Scott, 1981), which stresses the continuities between 
formal organizations and communities (Parsons, I960; Selznkk, 1948). like 
comrriinities, organizations have rich and complex political systems, and 
organizational action is often the outcome of political coir^sts among factions. 
Suburbs of organizations seek to defend self-interests and resist reallocations 
of resources when conditions change. Moreover, members often develop shared 
norms in opposition to management For these reasons organizations are at 
best "recalcitrant tools," as Selznick (1948) put it 

Much early work in the natural-systems perspective involved close ex- 
aminati i of tl/e actuai process of work in organizations, as in the famous 
studies from the Hawthorne experiment at the Western Electric works 
(Roethlisberger and Dickson, 1939). Also important were the case studies 
by students of Robert Merton at Columbia such as Peter Blau's (1955) 
analysis of patterns of exchange in a social work agency and Philip Selz- 
nick's (1949) study of the relations between the Tennessee Valley Authority 
and its local community. Recently, organizational sociologists have ex- 
tended this perspective by conducting comparative quantitative analysis of 
organizational politics. One particularly useful line of work, which follows 
the lead of the so-called Carnegie School (especially Cyert and March, 
1963), explores how control over essential resources converts to power 
within organizations and how power balances shape strategy and structure 
(see especially Pfeffer, 1981; Pfeffer and Salancik, 1978). 

As in the Weberian tradition, the natural-systems perspective has pro- 
vided detailed empirical information about the limitations of organizational 
solutions to problems of collective action. It has identified the processes 
that distinguish organizations from machines and shifted attention away 
from idealized images of organizations and toward recurrent patterns of real 
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organizational action. Nun^erous findings from this research tradition have 
also become enshrined in the conventional wisdom of management. 

Theory and research on organizations during the past 20 years have sought 
increasingly to synthesize elements of the two starkly opposed perspectives. 
This work retains the premise of rational-systems theory that organizations 
are created as tools for collective action and that, in the long run at least, 
performance matters. That is, this synthetic perspective takes issue with 
the implicit claim of the natural-systems perspective that organizations are 
somehow shielded from negative consequences of inferior performance. It 
also rejects the naive claims of the rational-system perspective that orga- 
nizations are simple, calculable machines. Instead it tttats organizations as 
open systems that depend on a continuing flow of resr/iirces from environ- 
ments. The necessity to maintain such a flow exerts at least some discipline 
on organizations. However, the fact that one essential resource — member- 
ship — comes with special interests and with attachments to other parts of 
the social world creates conditions of recalcitrance and inertia. According 
to various open-system perspectives, organizations are subject both to en- 
vironmental constraint and to strong inertia. The main theoretical problems 
concern the relation of these two kinds of constraints. 

Theee issues are most interesting theoretically and most relevant to practical 
problems when they are considered in the context of organizational change. 
tVspite die fact that inertial tendencies seem to be strong, especially for old 
una large organizations, the world of organizations has changed markedly over 
time. Organic ' **al forms that dominate today differ dramatically from those 
that held sway a century ago. Chandler (1977) gives a vivid account of the 
changes in organizational ft» ns in industry over this period. Similar changes 
can be found in the structures of labor unions, medical care organizations, 
and government agencies. Thus, changes in social, economic, and political 
systems apparently do affect organizational structures and practices. 

The major gaps in our understanding of organizational change concern 
the actual dynamics — exactly how does change in larger systems affect the 
distribution of organizational forms in society? In particular, how much of 
the change in the organizational world comes about through tinkering (adapt- 
ing organizational strategies and structures) and how much through replace- 
ment of one kind of organization by another? We are just beginning to learn 
about the relative rates of the various processes, which are crucial to an- 
swering this question. 

Perspectives on Organizational Change 

The contemporary literature contains at least three broad perspectives on 
organizational change. They all emphasize that uncertainty is an inescapable 
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problem for organizations and plays the key role in shaping their structure 
and action. 

TTie most widespread view, rational-adaptation theory, argues that or- 
ganizational structures are consciously chosen solutions to certain environ- 
mental problems. It suggests that the observed variability in the world of 
organizations reflects planned changes of strategy and structure in response 
to environmental uncertainties, threats, and opportunities. As a theory of 
change, this perspective holds that organizations identify threats and op- 
portunities and reshape structures to mitigate threats and exploit opportu- 
nities. This approach is mainly directed at explaining the success of large 
and powerful organizations, those that have managed to adapt well to 
changing environmental demands. 

There are numerous variants of this approach, which differ widely in 
some wa>s. Contingency theories stress the need for organizations to design 
structures feat buffer their production activities (the so-called technical core) 
from uncertain environmental variations (Lawrence and Lorsch, 1967; 
Thompson, 1967). Thus, optimal organizational design is contingent on the 
nature of the production process and of environmental variations. When 
either production processes or the pattern of environmental changes shift, 
organizations attempt to alter their structures, according to this view. In a 
similar vein, resource-dependence theory argues that organizations must 
take action to eliminate sources of uncertainty in the environment (Pfeffer 
and Salancik, 1978). When sources of uncertainty change, organizations 
are forced to alter their strategies and structures to resolve new threats to 
their resource flows. 

An institutional approach, discussed ai greater length below, holds that 
organizational structures are rationally adapted to environmental demands, 
but that the key demands are often normative and symbolic (DiMaggio and 
Powell, 1983; Meyer and Scott, 1983). In Lhis view, organizations dem- 
onstrate their competence within spheres of action and maintain flows of 
essential resources by displaying appropriate symbols. Such symbolism is 
often coded in structures. For example, firms ^splay their commitment to 
planning by creating planning committees or boards of directors and by 
creating planning departments. What these units actually do is much less 
important than their mere existence, according to current institutional the- 
ories. Moreover, as fad* and fashions in organizational designs change, 
organizations are expecteu to reshape their structures accordingly. As in 
die cases of contingency theory and resource-dependence theory, the var- 
iability of structures in the world of organizations is assumed to reflect 
planned adaptations to changing environmental demands. 

A second perspective, random-transformation theory, claims that orga- 
nizations change their st -rctures mainly in response to internal politics and 
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other endogenous processes, especially the search for solutions to problems 
of uncertainty. Because there is much randomness in the character of the 
search, such changes are only loosely coupled with the desires of organi- 
zational leaders and with the demands and threats of environments (March, 
i981; March and Olsen, 1976; Weick, 1976). 

The third perspective, ecological-evolutionary theory, holds that most of 
the variability in organizational structures comes about through the creation 
of new organizations and organizational forms and the replacement of old 
ones (Aldrich, 1979; Carroll, 1984; Freeman, 1982; Hannan and Freeman, 
1977; McKelvey, 1982; Nelson and Winter, 1982; Stinchcombe, 1965). 

These three perspectives disagree on the sources of organizational di- 
versity. According to rational-ad' station theory, the diversity of organi- 
zational forms in society reflects the diversity of environmental problems 
that must be solved. If the environment becomes more differentiated, di- 
versity will increase; if it becomes less differentiated, diversity will decline. 
The random-transformation perspective suggests that diversity reflects mainly 
the peculiar local and random character of problem solving in each orga- 
nization. Finally, the ecological-evolutionary perspective states that diver- 
sity uepends on the arrival rate of new organizations and on their diversity, 
on patterns of environmental variation, and on competitive dynamics within 
organizational populations and communities. 

Progress in explaining organizational diversity and change requires un- 
derstanding both the nature of organizational change and the degree to 
wh ; ch it can be planned and controlled. The remainder of this essay con- 
centrates mainly on the first issu : does most of the observed diversity in 
organizational features reflect changes in existing organizations, whether 
planned or not, or does it reflect changes in populations with relatively inert 
organizations replacing one another? In other words, does change in major 
features of organizations over time reflect mainly adaptation or selection 
and replacement? 

An Ecological-Evolutionary Approach 

If organizations are subject to strong inertial pressures and face change- 
able, uncertain environments, there are strong parallels between change in 
organizational populations and change in biotic populations. In this case it 
may be useful to analyze selection and replacement in populations of or- 
ganizations. As I try to illustrate below, this shift in focus has opened new 
and interesting questions. 

A population perspective concentrates on the sources of variability and 
homogeneity of organizational forms. It considers the rise of new organi- 
zational forms and the demise of existing ones. In doing so, it pays con- 
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siderable attention to population dynamics, especially the processes of 
competition among diverse organisations for limited markets. 

All accepted theories of biotic evolution share the assumption that in- 
novation, die creation of new strategies and structures, is random with 
respect to adaptive value. Innovations are not produced because they arc 
useful, they are just produced. If an innovation turns out ^ have adaptive 
value, it will be retained and spread through the population with high 
probability. In this sense, evolution is blind. How can this view be rec- 
onciled with the fact that human actors devote so much attention to pre- 
dicting the future and to developing strategies for coping with expected 
events? 

Most theorists assume that change in organizational populations is La- 
marckian, that major changes in the forms of organization come about 
through learning and imitation. Many kinds of organizations do devote 
resources to learning and espionage, often seeking to copy the forms of 
their more successful competitors. In a rough sense, organizations reproduce 
themselves either by setting up new organizations or by spinning off per- 
sonnel with the requisite knowledge to copy the form. Nelson and Winter 
(1982) have developed explicit models of such Lamarckian evolutionary 
change in populations of business firms. 

Another line of theory holds that change in evolutionary populations is 
more Darwinian than Lamarckian (Aldrich, 1979; Hannan and Freeman, 
1977, 1984; McKelvey, 1982). This work argues that inertial pressures 
prevent most organizations from radically changing their strategies and 
structures once established. Jt also argues that only the most concrete fea- 
tures of technique can be easily copied and inserted into ongoing organi- 
zations. Finally, it emphasizes density-dependent constraints on adaptation 
by individual organizations: although it may be in the interests of leaders 
of many organizations to adopt a certain strategy, die capacity of the system 
to sustain organizations with that strategy is often quite limited. Only a few 
can succeed in exploiting such a strategy, and "first-movers" have decided 
advantages. 

Even when actors strive to cope rationally with their environments, action 
may be random with respect to adaptation as long as the environments are 
highly uncertain or the conductions between means and ends are rot well 
understood. It is the match between action and environmental outcomes 
that must be random on the average foi Darwinian selection models to 
apply. In a world of high uncertainty, adaptive efforts by individual: may 
turn out to be essentiall -andom with respect to future value. 

The realism of Darwinian mechanisms in organizational populations also 
turns on the degree to which change in organizational structures can be 
controlled by leaders. Suppose that individuals leani to anticipate the future 
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and adapt strategies accordingly, and that organizations simply mirror the 
intentions of rational leaders. Then organizational adaptations would be 
largely nonrandom with respect to future states of the environment. On the 
other hand, if March and others arc right, organizational change is largely 
uncontrolled, and organizations staffed by rational planners may behave 
essentially randomly with respect to adaptation. In other words, organi- 
zational outcomes may be decoupled from individual intentions; organi- 
zations may have lives of their own. In this case it is not enough to ask 
whether individual humans learn and plan rationally for an uncertain future. 
One must ask whether organizations as collective actors display the same 
capacities. 

The applicability of Darwinian arguments to changes in organizational 
populations thus depends partly on the tightness of coupling between in- 
dividual intentions and organizational outcomes. At least two well-known 
situations generate loose coupling: diversity of interest among members 
and uncertainty about means-ends connections. When members of orga- 
nizations have diverse interests, organizational outcomes depend heavily 
on internal politics, on the balance of power among factions. In such 
situations collective outcomes cannot easily be matched rationally to chang- 
ing environments. 

When the connections between means and ends are uncertain, carefully 
designed adaptations may have completely unexpected consequences. 
Moreover, short-run consequences may often differ greatly from long-run 
consequences. In such cases, it does not seem realistic to assume a high 
degree of congruence between designs and outcomes. 

Thus, it may be useful in analyzing patterns of long-term change in 
organizational forms to supplement Larmarckian theories with Darwinian 
ones. The fact that members of organizations plan rationally for change 
and that organizations often develop structures designed to plan and im- 
plement change does not undercut the value of this view as long as orga- 
nizations are political coalitions and environmental change tends to be highly 
uncertain. 

Organizational Diversity 

An ecological-evolutionary approach directs attention primarily to or- 
ganizational diversity. It seeks to answer the question: Why are there so 
many (or too few) kinds of organizations? Addressing this question means 
specifying both the sources of increasing diversity, such as the creation of 
new forms, as well as the sources of decreasing diversity, such as com- 
petitive exclusion of forms. In other words, ecology of organizations 
seeks to understand how >cial conditions affect the rates at which new 
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organizations and new organizational forms arise, the rates at which indi- 
vidual organizations change structures, and the rates at which organizational 
populations die out. In addition to focusing on the effects of larger social, 
economic, and political systems on these rates, an ecology of organizations 
also emphasizes the dynamics that take place within organizational popu- 
lations. 

Questions about the diversity of organizations in society might seem to 
be of only academic interest. In fact, these issues bear directly on important 
social issues. Perhaps the most important is the capacity of a society to 
respond to uncertain future changes. Organizational diversity within any 
realm of activity such as medical care, microelectronics production, or 
scientific research constitutes a repository of solutions to the problem of 
producing certain sets of collective outcomes. These solutions are embedded 
in organizational structures and strategies. The key aspects of these solutions 
are usually subtle and complicated. In any large organization, no single 
individual understands the full range of activities and their interrelations 
that constitute the organizational solution. Moreover, the subtle aspects of 
the structure such as "climate" or "culture" defy attempts at formal en 
gineering specification. Therefore, it will often prove impossible to resurrect 
a form of organization once it has ceased to operate. If so, reductions in 
organizational diversity imply losses of organized information about how 
to adapt (produce) to changing environments. 

Having a range of alternative ways to produce certain goods and services 
is valuable whenever the future is uncertain. A society that retains only a 
few organizational forms may thrive for a time. But once the environment 
changes, such a sc iety faces serious problems until existing organizations 
can be reshaped or new ones created. Since reorganization is costly and 
may not work at all for the reasons stated above (and because new orga- 
nizations are fragile), it may take a long time to adapt to the new conditions. 
A system with greater organizational diversity has a higher probability of 
having in hand some solution that is satisfactory under changed environ- 
mental conditions. Adaptation to changing environments in such cases means 
mainly reallocating resources from one type of existing organization to 
another. 

The notion that diversity of organizational forms is a useful hedge against 
uncertain future changes in environments parallels a classic evolutionary 
argument. It is, for example, the same kind of argument that has been made 
against going overboard with the so-called Green Revolution in agriculture. 
The spread of single strains of crops implies a great reduction in genetic 
diversity, which may prove problematic if new kindj of pests arise to which 
the "miracle" crops are vulnerable. 

Organizational diversity affects society in another way. Since careers are 
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played out in organizations, the distribution of opportunities for individual 
achievement depends on the distribution of organizational forms. When 
diversity is high, individuals with different backgrounds, tastes, and skills 
are more likely to And o/ganiratioral affiliations that match their own 
qualities and interests. For example, die fact th?.t virtually every industry 
in the United States contains a sizable number ot sma 11 businesses allows 
ethnic and immigrant communities 10 create "ethnic encla .es' 9 within which 
to deveiop protected career paths. The presence of such niches in the 
economy, one kina of organizational di verity, has prowii crucial to the 
economic success Ot 4t * jast some ethnic communities. 

Diversity is also valued in ; ts own right. Consider the case of the daily 
press. It is widely agreed in this country that diversity of editorial opinion 
is a social good and ought not to be sacrificed to economies of business 
concentration in the industry. Similar views pertain to schooling, higher 
education, research laboratories, and all sorts of art-producing organiza- 
tions. 

How do social, economic, and political environments affect organiza- 
tional diversity? Almost all attempts to answer this question focus on the 
controlling role of uncertainty — stable and certain environments almost 
surely generate low levels of diversity. The main theoretical question is: 
How does environmental uncertainty affect diversity? 



One line of current research in sociology attempts to explain variations 
in organizational diversity within the context of what population ecologists 
call niche theory. The concept of niche is used in population ecology to 
refer to the set of coiulitions under which some form of life can perpetuate 
itself. The niche is a mapping between states of the environment and prob- 
abilities of expansion of numbers. Thus the niche summarizes the environ- 
mental dependence of a population. Much of the recent progress in 
bioecological theory has involved embedding naturalistic observations on 
the structure of niches in nature within Darwinian evolutionary theory (see 
Roughgarden, 1979). 

Much work on the evolution of the niche emphasizes niche width. Some 
forms, called generalists, persist under a very broad range of environmental 
conditions. Others, called specialists, thrive only in highly specific envi- 
ronments. Niche theories attempt to explain how patterns of environmental 
variations affect the evolution of niche widths in biotic communities, that 
is, how they affect the reproductive success of specialists and generalists. 

John Freeman and I have argued that many of the classic problems of 
environmental uncertainty and organizational structure can be recast prof- 
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itably as problems of organizational niche width (Freeman and Hannan, 
1983; Hannan and Freeman, 1977). Organizations clearly vary on this 
dimension. In our study of American labor unions, we find the Siderog- 
raphers Union, which seeks to organize a labor force that numbers in the 
hundreds (siderographers print currency and stock and bond certificates), 
and the Teamsters, who try to organize almost anyone who works. Likewise, 
the economy contains firms that produce a single product and others that 
produce a great range of products. The connection between niche width 
and diversity is straightforward. To the extent that social trends favor gen- 
eralist organizations, organizational diversity will decline. But, if specialist 
organizations have adaptive advantages, the society will contain many di- 
verse specialists. In other words, the dynamics of organizational niche width 
constrain organizational diversity. 

Theories of organizational niche width deal with a "jack-of-all-trades, 
master-of-none" problem. There are obvious trade-offs between the ca- 
pacity to withstand a wide range of environmental variations and the capacity 
for high levels of performance in any one environmental state. In part, 
these trade-offs concern organizational "slack*' or excess capacity. The 
ability to tolerate diverse environments requires the maintenance of nu- 
merous routines, patterns of activity that can be invoked by organizational 
subunits. As Nelson and Winter (1982) convincingly argue, organizations 
remember by doing, and the capacity to perform a routine declines rapidly 
with disuse. Generalists, who possess a wide repertoire, must devote con- 
siderable resources to simply maintaining the readiness of seldom-used 
routines . Because generalists must commit so many resources to maintaining 
and rehearsing routines, they sacrifice efficiency and effectiveness in per- 
forming any single routine. Therefore, at least some specialists usually 
perform better than generalists in any particular environmental state. Whether 
there are any adaptive advantages to generalism depends on the rate of 
environmental change and on the patterns of changes. 

Levins (1968) proposed a theory of niche width that seems applicable to 
these organizational questions: Optimal niche width depends on three fac- 
tors: (1) the magnitude of environmental variations relative to the adaptive 
capacity of the population, (2) the uncertainty of environmental changes, 
and (3) the grain of environmental changes. Certainty refers to the odds 
that the environment will turn up in any particular state; in a maximally 
uncertain environment each of the possible states is equally likely. Grain 
refers to the typical durations of environmental states. In fine-grained en- 
vironments durations are short relative to (unconditional) life expectancy; 
in coarse-grained environments, typical durations are long. The importance 
of this distinction is that fine-grained environments ought to be experienced 
(from a selection perspective) as a weighted average of the states. But 
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populations in coarse-grained environments cannot adapt to some average 
value; they must have the capacity to withstand long spells in any particular 
state. 

Freeman and I formalized the implications of Levins' model for death 
rates of specialist and generalist organizations in alternative regimes of 
environmental variations (Freeman and Hannan , 1983). The implications 
of this model agree in part with the existing literature but differ in one 
important respect. The conventional wisdom holds that uncertain envi- 
ronments always favor generalist organizations (see, for example, Katz 
andKahn, 1978: 131: Lawrence and Lorsch, 1967:8; Pfeffer and Salancik, 
1978; and Thompson, 1967:34-37). The model based on niche theory 
implies that uncertainty favors generalists only in coarse-grained envi- 
ronments. We tested this hypothesis using data on the lifetimes of res- 
taurant firms in 18 California cities. We find that the effect of uncertainty 
on the relative death rates of specialists and generalists does interact with 
grain in the predicted way. That is, the niche theory improves on existing 
theory in explaining the dynamics of niche width in this organizational 
population. 

Carroll (1985) has taken a slightly different approach to studying orga- 
nizational niche width. He points out that the life chances of specialist and 
generalist organizations depend on the density of each type in the environ- 
ment. Imagine a market with a center (high, concentrated demand) and a 
periphery consisting of pockets of heterogeneous demand. In the absence 
of competition, all organizations in the market concentrate on the center of 
the market. When the number of organizations competing in the market is 
high, the largest and most powerful generalists will typically dominate the 
center. If generalists are numerous, some of them will be forced to exploit 
more peripheral segments of the market. Because their size and power may 
allow them to outcompete specialists in the periphery, the life chances of 
specialists deteriorate when there are many generalists in the market. If, 
however, one or a few generalists come to dominate and push the other 
generalists completely out of the market, the opportunities for specialists 
to thrive in the periphery rise. Thus, concentration in a market should have 
the opposite effects on the life chances of specialists and generalists. As a 
market (or organizational field more generally) concentrates, the death rates 
of generalists will rise and those of specialists will fall. Analysis of death 
rates in populations of local newspaper firms supports this argument (Car- 
roll, 1985). 

Our research group is conducting additional research on the dynamics of 
organizational niche width among labor unions and semiconductor manu- 
facturing firms. A number of other groups are working on similar issues 
using different kinds of organizational populations. 
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INSTITUTIONAL ISOMORPHISM 

Organizational ecology also speaks to issues of structural isomorphism — 
the processes by which organizational structures become matched to features 
of the environment. Indeed, the initial work on the population ecology of 
organizations was stimulated partly by Hawley's (1950, 1968) classic ar- 
gument that organizational structures become structurally isomorphic to 
those organizations that control flows of resources into a local system. An 
example of this process is the spread through research universities of plan- 
ning and budget offices whose internal arrangements are close copies of 
those of the federal agencies from which unh sities obtain funding. 

Hawley's theory was silent on the processes that generate structural 
isomorphism. Hannan and Freeman (1977) argued that one route to iso- 
morphism is competition and selection at the population level — competitive 
isomorphism, as DiMaggio and Powell (1983) call it. But clearly this is 
not the only one. 

One important new line of sociological theory about organizations iden- 
tifies institutional processes that produce isomorphism. John Meyer and his 
collaborator (Meyer, 1978; Meyer and Scott, 1983) have argued that many 
features of organizational structure are symbols for competencies that may 
or may not exist. The important adaptive problem for organizations, es- 
pecially those producing products whose quality is hard to measure, is to 
evoke the appropriate symbols of competence. Moreover, organization builders 
are constrained by norms of rationality that dictate a limited number of 
routines and structures. 

As general societal processes of rationalization and state expansion pro- 
ceed, the set of available and endorsed building blocks becomes increasingly 
homogeneous, and organizational diversity declines. DiMaggio and Powell 
(1983:147-148) argue as follows: 

Bureaucratization and other forms of organizational change occur as the result of pro- 
cesses that make organizations more similar without necessarily making them more 
efficient . . . highly structured organizational fields provide a context in which individual 
efforts to deal rationally with uncertainty and constraint often lead* in the aggregate, 
to homogeneity in structure,. culture, and output. ... In the initial stages of their life 
cycle, organizational fields display considerable diversity in approach and form Once 
a field becomes well established, however, there is an inexorable push towards ho- 
mogenization. 

The argument that general norms of rationality and specific organizational 
agents (like the state, business schools, and professional associations) create 
pressures for structural homogeneity, and that these pressures are growing 
in strength is an important one. But there is a countertendency that must 
also be considered. 
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Assume that the population of individuals who demand and use services 
of organizations is heterogeneous. Assume further that organizational forms 
partly determine the character of organizational outputs. If, as the institu- 
tionalists claim, organizations are becoming more homogeneous, the frac- 
tion of demand for organizational outputs that is either unfilled or dissatisfied 
with current services will increase. This means that the gains from creating 
"deviant' ' Oiganizations to fill this demand will grow. This situation creates 
opportunities for "outlaw" entrepreneurs to experiment with new organi- 
zational forms. If any of the experiments are successful, the new forms 
ought to grow rapidly, lowering the homogeneity of the organizational 
population. 

It seems that industrial breakthroughs are often made outside institu- 
tional channels. The industrial giants in any one era often lack the fore- 
sight and flexibility to exploit radically new technologies and strategies. 
Sometimes new modes of production are inconsistent with standard op- 
erating procedures in the industry, producing conflict that drives creative 
individualr out cf giant firms into entrepreneurial ventures. The U.S. 
semiconductor industry provides an instructive example (Brittain and 
Freeman, 1980). Each of the large vacuum tube producers tried its hand 
at the production and marketing of semiconductor devices and failed. 
The industry became dominated by newly created firms. Almost 30 years 
into the history of the industry, there is still very rapid turnover in the 
list of leading firms. 

The crucial organizational innovations that create new industries and 
new goods and services arise mainly outside the highly institutionalized 
sphere. In fact, high levels of institutionalization may be a serious im- 
pediment to innovation. Understanding the forces that create organiza- 
tional diversity requires analysis of the social forces that shape attempts 
to create new forms of organizations and of the selection processes that 
apply to new forms. 

DISCUSSION 

An ecological-evolutionary perspective on organizational change rec- 
onciles the major insights of the two classic traditions of organizational 
theory and research. It assumes along with atonal-systems perspectives 
that organizations are designed as tools to achieve collective ends. Because 
organizations compete among themselves for scarce resources , membership, 
and legitimacy, efficiency in mobilizing each of these affects survival chances. 
In this sense, organizations face efficiency tests. However, the efficiency 
testing assumed in current ecological theory is much more complicated than 
simple testing for technical efficiency in producing some product or service. 
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Efficiency in mobilizing resources or in currying political favor may often 
be more decisive in affecting survival chances than narrow technical effi- 
ciency. Thus the 4 'rationality" involved in organizational selection pro- 
cesses may be considerably broader than that envisioned by Weber and 
Taylor. Nonetheless, ecological-evolutionary perspectives pay explicit at- 
tention to efficiency testing, broadly defined. 

Ecological-evolutionary perspectives also build on the natural-systems 
notion that organizations take on lives of their own. Because organizations 
must delegate decisions to human actors, they cannot escape processes of 
political conflict and the creation of subgroup norms and loyalties. Initial 
patterns of action tend to become enduring bases of political bargaining 
and of group loyalties. Subsequent attempts to change drastically the struc- 
ture of an organization encounter both self-interested political objections 
from subgroups who will lose resources and normative objections to chang- 
ing rules and structures that have become infused with symbolic value. For 
these and other reasons, organizations seldom function exactly as planned 
and are very difficult to reshape. 

Inertia and change in the composition of organizational populations over 
time are easy to reconcile within a population perspective. New organi- 
zations enter many organizational populations at a reasonably high rate. 
These new entrants are often the carriers of new strategies and structures, 
the main source of diversity of forms. At the same time, existing organi- 
zations drop from sight either by simply disbanding or merging with other 
organizations. The merger process seems to have become the main vehicle 
by which large and powerful organizations cease to have independent effects 
on the society. Thus, the populations of business firms, government agen- 
cies, and others have changed over time in response to the creation of new 
firms and agencies carrying new forms and following new agendas. They 
have also changed in response to mergers among existing firms and agen- 
cies. 

This view of organizational change directs attention to social policies 
that affect the rate of creation of new organizations. It suggests that dis- 
cussion of industrial policy, for example, should pay less attention to es- 
tablished, giant firms than to the social, political, and economic processes 
that affect the rate at which new firms are started and the life chances of 
new finns using tnnuvaUve strategic* aiid structures. More generally, it 
points to the importance of organizational diversity to society and empha- 
sizes the need to better understand how social policies affect such diversity. 



Many of the ideas discussed here were worked out in collaboration with 
John Freeman. Susan Olzak made helpful comments on an earlier draft. 
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ORIGINS OF THE SUBJECT 

Historical research may uncover some obscure roots or primitive example 
of macroeconomic models, estimated from numerical records and meant to 
be realistic, but I doubt that anything clearly in the spirit of the present 
effort can be found prior to the investigations of J. Tinbergen, first for his 
own country, the Netherlands, in the early 1930s, and later for the League 
of Nations' analysis of the economy of the United States, intended for a 
better understanding of the Great Depression (Tinbergen, 1939). 

Tinbergen's contribution was to show that the essence of die macro 
economy could be expressed compactly in an equation system, that thf je 
systems could be fit to real-world data, and that revealing properties of the 
economy could be analyzed from such systems. To a large extent, Tinbergen 
was interested in the cyclical properties of the system. That was his main 
reference point in studying the American economy in the context of the 
stock market boom of the 1920s, followed by the Great Crash and recovery 
during the Great Depression of the 1930s. He was plainly impressed and 
inspired by the implications of the Keynesian Revolution, but his greatest 
work on econometric models, that of the United States for the League of 
Nations, was never put to practical use in the national scene. His model 
estimation for the Netherlands formed the basis for Dutch postwar economic 
policy implementation at the Central Planning Bureau, which he directed 
after World War II. 

There was a hiatus, naturally, caused by the war, but during the closing 
months of 1944, J. Ma^chak assembled a research team at the Cowles 
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Commission of the University of Chicago. The Cowles Commission was 
founded end supported by Alfred Cowles, originally for the study of security 
prices and eventually for the investigation of mathematical and statistical 
methods in economics. I was recruited for the Cowles team for the express 
purpose of taking up the thread of work initiated by Tinbergen. Several 
lines of thought converged at the Cowles Commission during the middle 
and late 1940s. These were: 

• The concept of a mathematical model of the macro economy 

• An emerging theory of econometric method 

• A growing body of statistical data on the national economy 

The macro model concept built on the intense intellectual discussions 
among economists about the interpretation of the theories of J.M. Keynes 
in The General Theory of Interest Employment and Money. The mathe- 
matical formulations of that theory by J.R. Hicks (1937) and O. Lange 
(1938) formed the basis for a whole new way of thinking about the aggre- 
gative economy. F. Modigliani had written a provocative article extending 
the Keynesian analysis (Modigliani, 1944), and I had just completed the 
dissertation draft of The Keynesian Revolution in 1944. The mathematical 
models in these writings lent themselves well to manipulation to study the 
movement of principal magnitudes of the economy and were formulated in 
terms of measurable concepts. It was essentially a case of 'actors in search 
of a play." The Keynesian theory was simply crying for econometric im- 
plementation. 

In a miniature, condensed, and aggregative sense, the Keynesian theory 
was a simultaneous equation version of the Walrasian system for the econ- 
omy as a whole. In the nineteenth century, L. Walras, professor at Lau- 
sanne, formulated a view of the economy as a result of the solution of a 
set of simultaneous equations . His set referred to the detailed micro economy 
and, conceptually, to an enormous system of n equations in n unknowns, 
where n is as large as the goods and services of an economy are numerous, 
i.e., in the millions, or billions, or even larger. At a macroeconomic level 
the Keynesian economic theory recognized the simultaneity clearly. For 
example, it was noted that aggregate income affected aggregate spending 
in the economy, while at the same time aggregate spending affected the 
generation of aggregate income. But statistical practice did not take this 
simultaneity properly into account. This idea was exploited by T. Haavelmo, 
much inspired by A. WaJd, who contributed a great deal of the statistical 
thinking — as far as probability and the laws of inference are concerned — 
and also the dynamics. Two important papers were produced that shaped 
the statistical approach of the Cowles Commission team (Haavelmo, 1943; 
Mann and Wald, 1943). This approach has not flourished as much as the 



y& ice 



MACROECONOMIC MODELING AND FORECASTING 



97 



approach of building macroeconometric models for practical and theoretical 
application, but it was instrumental in providing a deep understanding of 
econometric method. The statistical approach was a moving force in the 
formative days, even though it is not preeminent at the present time. 

The actors and the play came together in the actual statistical calculation 
of models. They began to emerge as early as 1946 and were first used by 
the Committee for Economic Development in assessing economic prospects 
for the postwar world. The emphasis was different from Tinbergen's. The 
principal goal was to build models in the image of the national income and 
product accounts for the purpose of making forecasts and for guiding eco- 
nomic policy. The kinds of policy formulations implicit in Keynes's Genera! 
Theory were plainiy operative in these systems. 

A PERIOD OF EXPANSION 

The history of the development of models during this period and in the 
subsequent two decades or more is traced by Martin Greenberger et al. 
(1976) and Ronald Bodkin et al. (1980). It consists of tracing the models 
from the Cowles Commission at Chicago, to the Uni/ersity of Michigan 
(Ann Arbor), Canada, the United Kingdom, and elsewhere up to the present 
day, when there are literally hundreds, in daily operation all over the world. 
The major actors in this history were, in addition to the author, Colin Clark, 
Daniel Suits, Arthur Goldberger, R.J. Ball, Otto Eckstein, J. Duesenberry, 
and Gary Fromm (Clark, 1949; Klein and Goldberger, 1955; Suits, 1962; 
Klein et al., 1961; Duesenberry et al., 1960). 

During the period of enthusiastic development at the Cowles Commission 
it was thought that applications of the most sophisticated and powerful 
methods of statistical analysis would provide a breakthrough in practical 
accomplishments, but complexity of computation remained a bottleneck. 
Only demonstration models could be given a full statistical treatment, and 
that was very laborious. 

The use of cross-section data (surveys of individual economic units — 
households and establishments), and finer units of observation in time series 
(quarterly and monthly data), were looked to as other routes for a break- 
through. Cross-section data were • 'noisy " although revealing. Monthly and 
quarterly data were highly serially correlated but helpful in cyclical analysis. 
Trend data, in the form of decade averages over long periods of time, were 
also examined for possible leads, but they were too smooth to make an 
enormous difference. 

In the early 1960s a breakthrough did occur, in the form of the electronic 
computer, which was harnessed to the needs of econometrics. In the 1950s 
there were some early attempts at massive computer use, which worked 
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well for selected aspects of econometric computation; but it was only through 
the use of the computer in successive stages that very significant achieve- 
ments were realized. It is noteworthy that at a 1972 conference in honor 
of John von Neumann's contributions to the development of the electronic 
computer, held at the Institute for Advanced Study in Princeton, most of 
the reports claimed that the computer was of only moderate significance in 
advancing scholarly subjects in the sciences. Some of the contributions 
bordered on the cynical side But economics was an exception. In my own 
paper, I claimed that the computer absolutely transformed research in quan- 
titative economics and in econometric model building in particular. 

The use of cross-section and sample survey observations in general has 
a long history in econometrics, both theoretical and applied. These obser- 
vations were used more for the study of building blocks at the micro level 
of analysis, but as the computer became more available for econometric 
research, it became more plausible to build models that evolved into com- 
plete systems from the micro units. Th^se are micro simulation models, 
introduced at an early stage by Guy Orcutt (Orcutt et al., 1961). Large 
segments of the total economy, if not the entire macro economy, can be 
modeled in this way. Such systems are not as widespread in use as are 
aggregative models of the economy as a whole, but progress in provision 
of data, techniques of data management, and system computation will enable 
model building to go in the direction of micro simulation systems in the 
future. At thj very least, they will be revealing for the study of particular 
aspects of human behavior patterns because they get right down to the basic 
agents of behavior. 

By 1964 it was possible to compute full-information maximum likelihood 
estimates of equation systems with 20 or more equations. The usual ex- 
amples had dealt with systems of 3, 4, or 5 equations using hand methods. 
The computational problem, as far as statistical method in econometrics 
was concerned, was fully resolved during the 1960s. Programs for esti- 
mation in nonlinear systems, autoregressive correction, estimation of dis- 
tributed lags, ridge regression, generalized regression, and many other 
estimation problems were made available for routine use. All the bottlenecks 
that had appeared earlier were suddenly broken. Econometric scholars were 
able to handle data much better and explore data much more extensively 
in the search for good estimates, but there was no seeming increase in 
accuracy, efficiency, or applicability of econometric models from this par- 
ticular line of research. The next real breakthrough came in connection with 
some research on the Brookings model for the macro economy of the United 
States. This was a quarterly mode! put together by a team of specialists 
working under the auspices of the Committee on Economic Stability of the 
Social Science Research Council. The work started in 1960, but a model 
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was not available until 1963 and was then transferred to the Brookings 
Institution. 

The principal problem was the computation of a solution to a system of 
some 300 equations, which seemed very large at the time. After much 
detailed experimentation, a method of solution was found in 1965. It was 
a form of a well-known Gauss-Seidel figorithm. which proceeds by iterative 
substitution of partial solufc :° from one equation to the next in a long 
succes ion through a whole sy^em. This method . .is tedious but inexpen- 
sive and fast on an electronic computer. Although it involved a very large 
number of calculations, it was accurate, efficient, and workable. It is now 
a routine method used worldwide for solving systems of simultaneous equa- 
tions in economics. Once the method had been streamlined for either non- 
linear or linear econometric models, the technique of simulation was 
extensively developed for the analysis of numerical models. 

Use of this instrument was a breakthrough in the following senses: 

1. Systems of any manageable size could be handled. A system of 100 
equations was considered modest (for the first time), and systems of thou- 
sands of equations are frequently used. Manageability is governed by the 
capability of human operators to collect data, have it ready for use, and 
understand the working of the system. 

2. Economic concepts such as multipliers became generalized into al- 
ternative policy simulations, scenarios, dynamic or static solutions, or sto- 
chastic simulations. All of these enabled workers in the field to do much 
more with models, to understand their dynamics and sensitivities. For policy 
application in both private and public sectors, extensive simulation analysis 
is essential. 

3. Frequency response characteristics of dynamic systems could be stud- 
ied. Eigenvalues in linear or line* ized systems could be studied; methods 
of optimal control could be applied. 

4. The presentation of econometric results for the noneconometrician or 
even the noneconomist became possible. Solutions of abstract mathematical 
systems could be presented in the form of instructive accounting tables, 
graphical displays, time patterns, and condensed reductions of large, com- 
plicated systems. 

5. Error statistics could be computed. With stochastic simulation meth- 
ods probability limits on forecast error bands or regions could be evaluated. 
Extensive recordkeeping for model performance became possible. 

The forms of calculation and analysis just listed were always possible. In 
linear systems they could be expressed in closed form relationships for most 
cases, but they could never have been done on a significant scale and would 
have attracted only very few patient individuals to the field of applied 
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econometrics. Those of us who toiled for 2 or 3 days to make one alternative 
analysis of a 20-equation system still marvel at the hourly accomplishments 
of the computer with a model. We could never have dreamed, in 1945, 
that we would be using models so intensively, so extensively, with such 
an audience, or with as much precision as is real ; zed today. The computer 
is the key. 

At the present time econometric software takes one from beginning to 
end ("cradle to grave"). Data come in machine-readable form from the 
primary, usually official, source. The data are managed by software de- 
signed to take out seasonal variation, arrange in order for computation, 
correct for inflation, and form transformations (ratios, sums, logarithms, 
etc.). Estimation routines are put to work directly from updated data files. 
The estimated equations are screened and selected ior use in models. Sim- 
ulation routines arrange the equations for dynamic solution in various forms 
(stochastic, deterministic, steady state, short run). All these things are done 
internally — within the computer. Finally tables, charts, and graphs are 
generated as reports. This is the patterned sequence for the present computer 
age. 

Every breakthrough has its drawbacks. Today's econometrician can make 
horrendous mistakes because the computer stands between the original 
material and the final results. Modern investigators do not look at each 
sample value with the same care that older econometricians used when the 
data were personally handled. A certain number of initial mistakes or non- 
sensical results have to be tolerated as payment for the enormous amount 
of good material that is generated. Only the **old hands" know where to 
look for troubles and avoid pitfalls, because the typical modern investigator 
does not want to search into or behind the computer processing unit. 



The computer is a facilitator but it does not guarantee good analysis or 
usage, enabling us to produce econometric findings on a large scale, pre- 
sentable in convenient form. It is now time to consider the impact of this 
effort, not particularly from the viewpoint of the immediate users, but from 
the viewpoint of scholarly thought. 

Econometric methods are often used to test economic theory. The models 
are themselves often, and preferably, based on received economic theory. 
Whether or not they fit the facts of economic life should tell us something 
about the validity of the underlying theory. 

Some direct tests of economic theory have been decisive, but we often 
come up against the fact that the data of economics , which form the sampling 
basis for econometric inference, are not sharp enough or abundant enough 



CONTRIBUTION TO THOUGHT 



ERIC 




MACROECONOMIC MODELING AND FORECASTING 



101 



o come to decisive conclusions in many cases. More than one hypothesis 
is often consistent with a given body of data. 

We have, however, been able to reject the crudest and most simplistic 
of theories. The data, and tests based on these data, have been conclusive 
in rejecting crude acceleration principles or the crude quantity theory of 
money: 

I, = aQ + e, 
M t = k [GNP ($)] t + Ul 

where 

I, = net real investment in period t 

C t = rate of change of real consumption during period t 
M t = money supply at time t 
[GNP ($)] t = nominal GNP during period t 
e,, u t = random errors 
a,k = parameters 

If we hypothesize that I t is proportional to C t apart from an additive 
random error that is obtained from a fixed distribution with finite variance 
and no serial correlation, we would find that the data do not support this 
model. 

It is, of course, a simple model, and if it is generalized by replacing 
Q by total real output and introducing other variables such as capital cost 
and capital stock we get the generalized accelerator, which does appear to 
fit the facts. That does not mean that we have proved that the generalized 
accelerator is the only investment function that fits the facts; indeed, there 
are others that are consistent with observed data, but we have been able to 
reject the crude, and original, version of the accelerator hypothesis. 

The same is true of the crude quantity theory. Either Milton Friedman's 
generalization, by introducing distributed Lgs in the price and real output 
factors (components) of [GNP ($)], or some extended liquidity preference 
version of the Keynesian theory both fit the facts. We cannot discriminate 
definitively between the monetarist hypothesis of Friedman or the portfolio 
hypothesis of Keynes, yet we have been able to reject the crude form of 
the theory. 

There are many examples like these, but it can legitimately be argued 
that such testing does not take us very far because it leaves an embarrassingly 
large number of competitive hypotheses still standing as possible expla- 
nations of reality. When we go beyond the simple single relationship, 
however, to fully determined models of the economy as a whole, we find 
that it is not easy to put together an entire set of relationships that explains, 
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to an acceptable degree, the evolution of the macro economy. It is, ad- 
mittedly, difficult to settle on a single set of compost itionships for 
the economy as a whole, yet it is possible to find more a one that passes 
the conventional statistical tests. 

The econometric approach should be used, therefore, painstakingly and 
slowly, to sift through this multiplicity of systems in replicated applications 
and to test them by the strength of their predictive powers. It is usually the 
case that particular models will wi.rk well on one occasion or another, but 
it is not at all easy to find a system that will stand up to predictive testing 
period after period. If an estimated model does predict well in repeated 
applications, then we get some evidence about the validity of the hypothesis 
on which it is based. 

Some macroeconometric models that were thought to rest firmly on 
accepted hypotheses , such as the St. Louis model , the Fair model , or various 
supply-side models, did so poorly in predictive testing that they were deemed 
failures (McNees, 1973; Fair, 1976; Andersen and Carlson, 1976). 1 Mon- 
etarist economists were enthusiastic about their hypotheses in the late 1 960s , 
but when the St. Louis model, which was based on those hypotheses, came 
up against the oil embargo, OPEC pricing, food price inflation, and the 
automobile strike of 1969, its operators declared that it was not meant for 
short-run prediction of the economy. This statement contradicted their orig- 
inal hypotheses. Eventually the monetarists contended that the model could 
be used for long-term but not for short-term analysis. In this case, it appeared 
to fit the data of the economy for awhile, but with repeated use, observations 
emerged that were not consistent with its underlying theory. The same is 
true of the original Fair model. 

As for the supply-side models that were introduced in the late 1970s, 
they were never tested against the facts, but when they were estimated and 
confronted with the data of 198 1-1982 they failed to stand up as maintained 
hypotheses. More conventional models correctly predicted that this would 
be the outcome. 

Given only limited success for macroeconometric model analysis in test- 
ing economic theory, how has it otherwise contributed to economic thought? 
The main contributions have been to decisionmakers, the users of econo- 
metric information in the public and private sectors. They are usually ad- 
ministrators and executives in large organizations and enterprises. Legislators 
also fall into this group. In a sense, the utility of these models is indicated 
by the degree of their use. There are systematic records of their forecast 



! Thc use of supply-side models is quite new; fully documented citations will not be ready for 
a few more years. 
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history but no systematic records of their performance in decisionmaking. 
Since only one outcome is actually observed, it is impossible to judge their 
accuracy with respect to unobserved alternatives that tend to be considered 
by the decisionmaker in reaching a choice. 

Decisionmakers say that they make better choices than they would oth- 
erwise be making without the use of such models, and that econometric 
models are the only tool available in a large variety of situations. This is 
why econometric model-building activity is expanding so rapidly all over 
the world. 

To a large extent, the first models were national in scope and fitted 
together with the emerging body of data on national income and product. 
But many subnational and supranational macroeconometric models, for 
industries, markets, or regions of a nation, are now either in use or being 
built. Many of these are designed in integrated feedback mode with com- 
prehensive macro models, while many are also built in satellite mode, 
without feedback. 

At the international level we now have many models connecting different 
countries and regions in a macro world system. One of the first of such 
systems was designed as an international trade model by J.J. Polak (19S3). 2 

In 1969 Project LINK was initiated, with the objective of consistently 
tying together models of the main trading nations to analyze the international 
transmissior lechanism (Ball, 1973; Klein et al., 1982). The project now 
consists of the interrelated network of models for 25 industrial countries — 
the Organisation for Economic Co-operation and Development — (OECD), 
8 centrally planned economies, and 4 regions of aggregative developing 
country models. Work is under way to add more than 25 models of indi- 
vidual developing countries. Project LINK is approaching, in the true sense 
of the word, the status of a world model. 

After the implementation of LINK in analyzing the world economy for 
such things as oil price shocks and predictions of various global totals, 
many other supranational systems have been designed: INTERLINK, by 
the OECD; the TSUKUBA-FAIS Model by Tsukuba University, Japan; the 
Multicountry Model of the Federal Reserve Board; the World Economic 
Model of the Economic Planning Agency, Japan; the FUGI Model of Soka 
University, Japan; and the World Model of Wharton Econometrics. These 
systems vary according to size and focus. Some concentrate on exchange 
rates and balance of payments flows; others are principally concerned with 
trade. Some emphasize the long term; others the short term. Nevertheless, 
they all have a common interest in the global outcome of economic activity 



2 Othcr early models were COMET and DESMOS (see Barter), 1981; Dramais, 1981). 
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and will be used with increasing frequency in an economic world that is 
becoming increasingly interdependent. Most of these systems were devel- 
oped during the past 10 years, and it is evident that many more will be 
developed in the period ahead. 
Specifically, the systems are used to study: 

• the exchange rate system 

• trade liberalization or protection 

• world business cycle analysis 

• worldwide disturbances (oil, food, raw materials) 

• international debt problems 

• policy coordination among countries 

• transfers of capital 

• international migration 

As new, equally pressing, issues arise the models will be adapted to their 
analysis. 

SOME NEW LINES OF DEVELOPMENT 

Econometric model building in the computer age has moved in the di- 
rection of the large-scale (1,000-or-more-equation) system with many sec- 
tors, rich dynamics, nonlinearities, and explicit stochastic structure. It has 
never been viewed as an issue of "bigger is better"; it is mainly an issue 
of detail. In large, detailed systems of this sort a main interest has been 
the development of scenario analysis. This procedure generalizes the entire 
concept of the multiplier, which is meant to show the relationship between 
any particular endogenous variable iy xX ) and any corresponding exogenous 
variable (Xj t ): 

dy- 

— other x's unchanged 

dx jt 

This general expression includes the original Keynesian multiplier 

dGNP = 1 
dl 1 - mpc 

mpc = marginal propensity to consume 
GNP = real gross national product 
I = real investment (exogenous) 

This simple multiplier expression is designed to show the GNP that would 
be generated by an increment in fixed investment. For most countries the 
GNP gain would outstrip incremental investment, making the multiplier 
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greater than 1.0. Conventional wisdom, derived from large-scale econo- 
metric models of the United States, indicates that the multiplier, after taking 
a much more elaborate system structure into account, is about 2.0 after a 
period of about two years' sustained stimulus to investment. 

The scenario, as distinct from the multiplier, simply imposes changes 
on a model. These changes can be at any place and any time; they may 
alter any element or group of elements. Computer simulation enables the 
investigator to compare a baseline or reference solution to a model system 
with a scenario solution. Scenarios can be quite creative — the scenario of 
disarmament, of harvest failure, of embargo, of stimulative policy, of tech- 
nical progress. Investigation is virtually unlimited. It is important to have 
a credible and fully understood baseline solution; the scenario then produces 
a discrepancy that reflects the investigator's creative inputs. This can be 
quite revealing and is of the greatest importance for policymakers, deci- 
sionmakers, or interested economists. There is unusual interest in a model's 
forecasts, but there is as much interest in scenario analysis. Scenario analysis 
permits rapid response to changing situations, such as abrupt shifts in policy, 
embargoes, strikes, and natural disasters. It is also the natural tool for 
planning. 

Jt is evident that the rapid, frequent, and flexible implementation of 
scenarios would not have been possible without the use of the computer. 
In addition, the report-presentation capabilities of the computer enable the 
model operator to communicate final results with a high degree of expo- 
sitional clarity. These applications on a large scale have been available for 
only about 10 or 15 years. They are undergoing further development, 
particularly for use with microprocessors. From the point of view of de- 
velopment of thought, this aspect is likely to be mainly of pedagogical 
significance. 

But the use of scenario analysis in the formulation of economic policy 
is leading in another direction that does have some methodological and 
theoretical significance for econometrics. The formal theory of economic 
policy was introduced some 30 to 40 years ago by J. Tinbergen and others. 
Tinbergen drew a distinction between targets and instruments of policy. 
The former are the subgroup of endogenous variables that policymakers 
want to have at specified values some time in their planning horizon, while 
the latter are the items of control among the exogenous variables that 
policymakers can influence. For example, bank reserves are instruments 
affected by the Federal Reserve system's open market operations that are 
fixed at certain values in order to achieve policy targets such as various 
money supply aggregates. The formal theory establishes the choice of in- 
struments in relation to target goals in the framework of a loss or gain 
function that the policymakers attempt to optimize, subject to the constraints 
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of the working of the economy, represented by a macroeconometric model. 
We are not, by a long shot, near the point at which policy can be routinized 
through this optimization process. We are not able to do what scientists 
and engineers accomplish with applications of optimal control theory to the 
operation of physical systems such as a boiler. We have borrowed many 
useful ideas from the control theory literature, and the development of this 
stream of econometric analysis is providing a deeper understanding of the 
workings of models, especially inputs for exogenous variables over distant 
horizons. These inputs are those that keep the system close to some a priori 
targets. 

Control theory calculations may become difficult if used in large-scale 
systems. This has been the case, particularly, in large international systems 
with multimodel components, as in Project LINK. For this reason simplified 
and small systems are frequently used to facilitate the control theory ap- 
plications to econometrics. This use is not meant to bring control theory 
to the needs of actual policy application. Eventually, the cumulation of 
knowledge and even farther computational advances will make control 
theory methods more suitable for use in large, state-of-the-art models. 

Despite the advances in econometric model building and the growth in 
its use, there are skeptics among professional economists. Some argue that 
models are not sufficiently accurate although users do seem to appreciate 
their accuracy to the extent that they find them important elements in their 
own tool kits. They will undoubtedly continue to use macroeconometric 
models unless and until something better, cheaper, and more convenient is 
made available. 

For some time, analysts who work with different systems have made 
claims of either superiority, equality, or cheapness, but they have not 
produced convincing evidence to support their claims. In some respects, 
time-series models, which are based purely on sample data with no (or 
minimal) underlying economic theory, claim to be an alternative. At present, 
it may be said that time-series models, in single equations or systems of 
equations, produce forecasts that are at least as good as macroeconometric 
models for short time horizons, say up to six months or less. Time series 
models do not provide a flexible vehicle for scenario analysis, but they do 
provide forecasts. 

The contest between time series and macroeconometric models will con- 
tinue on its present course, and it is plausible to believe that each has 
something to learn from the other, but there is also another possible route 
in which they may be developed together for construction of a better system. 

It is first necessary to digress for the explanation of a controversial issue 
in connection with application of large-scale models. After a model is 
estimated from a typical sample of economic data, it is invariably found 
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that extrapolations outside the sample lose accuracy quickly. For (me year or 
two, a freshly estimated model may stay close to economic reality, but it has 
never proved possible to estimate a model from a time-series sample and use 
that model in extrapolation in an automatic way with zero (mean) values 
assigned to the stochastic error temis. All such attempts have failed to pass 
forecasting tests outside the sample. The reasons for this failure are: 

L data revisions between the estimation period, within the sample, and 
the extrapolation period; 

2. legal or institutional changes in the functioning of the economy; 

3. the occurrence of an unusual disturbance (war, strike, embargo, nat- 
ural disaster); 

4. temporary behavior drifts. 

One way of dealing with some of these problems in small systems is to 
reestimate the model every period (as often as every quarter) before extrap- 
olation. This is entirely possible in small systems but not in the large models 
presently in use. Instead, common practice is to estimate nonzero values for 
the stochastic component to bring the model solution close to reality for the 
initial period before extrapolation. In general, this period will be one outside 
of the sample. We line up the model so that it starts an extrapolation period 
at the observed values. Its reaction properties are left unchanged unless there 
has been a statutory or other known structural change. 

The adjustments are made to equations on a nonmechanical basis, and 
the criticism of model builders' practices is that they are changing their 
systems prior to application by a method that is not purely objective. (It 
may be better to say, "by a method that is not purely automatic," because 
objective criteria are used in choosing the nonzero values for the stochastic 
terms.) 

A suggestion by Clive Granger, who is an exponent of time-series meth- 
ods, may indicate a fruitful alternative that is automatic. Granger suggests 
that forecasts be made from linear or other combinations of models in order 
to spread risk in an uncertain exercise. A different combination is the 
following: Time-series equations can be estimated for each endogenous 
variable of a model. These can be automatically recalculated on an up-to- 
date basis as new observations are made available. Error values in an 
extrapolation period can be assigned to each equation of a model so that 
the time-series estimate of the normalized endogenous variable for each 
equation is obtained. 3 In other words, the model is automatically and ob- 
jectively adjusted to "hit" the time-series estimate of the value of each 



3 A normalized variable is the variable that carries a unit coefficient :n each stochsstic equation. 
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endogenous variable in the initial solution period. This adjustment is then 
retained for the balance of the extrapolation horizon. 

Variations on the pure time-series model are provided by the so-called 
"current quarter model," a system in which pure time-series estimates are 
supplemented by estimates that make use of early reports on the macro 
economy by using daily, weekly, or monthly advance estimates of key 
magnitudes. A strong current-quarter model may provide better estimates 
of initial values of endogenous variables than can a pure time-series model. 

This area of research for improving the adjustment procedure is one that 
is presently receiving attention and appears to be promising. 

Other lines of development are in variable parameter models and in 
generalized models that transcend the narrow scope of purely economic 
relationships. 

Parameters may vary over time, systematically or randomly. Methods 
for dealing with these models have been proposed from time to time. There 
is no immediate breakthrough in sight, but it is an area that merits much 
attention. 

As for enlarging the scope of models, we have attempted to bring in 
more policy variables and introduce political reactions. This is particularly 
the case for intervention in the foreign exchange markets in this era of 
floating rates. 

Economists tend to draw a line and delegate responsibility to demogra- 
phers for birth, de^th, morbidity, and other population staristics; to cri- 
minologists for crime statistics; to psychologists for attitudinal variables; 
to political scientists for voting magnitudes, and so on. Usually, these 
external magnitudes are classified as exogenous variables, but many interact 
with the economic system in feedbacK relationships. Over the years the 
scope of endogeneity has expanded. Many models generate age-sex-race 
distributions of labor force, employment, and unemployment. The under- 
ground economy, theft losses, and statistical discrepancies have not yet 
been integrated with criminology theory, for example, but many possibilities 
exist for going much further in the endogenous treatment of important 
variables. Much more of the public sector is now endogenous than in the 
earliest Keynesian models. The social science, legal, and engineering as- 
pects of models need fuller integration and are likely to be handled that 
way in the future. 
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Numbers and Decisionmaking 



Public Statistics and 
Democratic Politics 



KENNETH PREWITT 



If, to paraphrase Harold Lass well, politics has become how much for 
how many, it is cle?r that measurement moves toward the center of political 
life. The result is a . olitics of numbers — What is to be counted? By whom? 
Can the numbers be trusted? In which direction is the trend line moving? 
Who is at fault for the (now numerically defined) failure of a policy or 
program? The intrusion of numbers into politics is global, as the world's 
nations now endlessly debate issues couched in numerical estimates and 
forecasts: weapon counts, oil reserves, trade balances, North-South ine- 
quities, debt ratios. 

With reason, then, scholars have focused their attention on how numbers 
are generated and subsequently used or misused in politics. This important 
scholarship rests on the assumption that public statistics are not politically 
neutral. Decisions about what to count are influenced by the dominant 
political ideologies, and numbers enter the political fray on behalf of social 
interests. 

The approach adopted in this essay accepts this assumption but focuses 
it as follows: public statistics in the United States are generated as part of 
democratic politics. This invites inquiry into the ways in which this par- 
ticular nation's "number system" advances or retards democracy, informs 
or distorts civic discourse, helps or hinders political participation. For just 
as public statistics are not neutral with respect to the everyday politics of 
group interests, so they are not neutral with respect to the principles and 
practices of democracy. Consequently, to study constitutional democracy, 
as it is today practiced in the United States, requires a perspective on 
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numerical reasoning and the nation's number system. Providing this per- 
spective is a task for social theory. 

There are of course unresolved issues in what does, or should, constitute 
democracy in the United States. We cannot attempt here to sort out the 
relative emphasis that contending theories of democracy give to such issues 
as popular particip?* ; on, economic and social equalities, the protection of 
property, civil liberties and citizen rights, or democratic procedures. In this 
chapter we take the simpler route of concentrating on two central issues: 
accountability — how public leaders are held accountable for their perfor- 
mance in office; and representation — how diverse interests are represented 
in setting the political agenda. 



The centrality of the concept of accountability in democratic theory de- 
rives from the observation that democracies no less than other forms of 
government have public officials with immensely more power than average 
citizens. Democratic theory does not deny the power advantage enjoyed by 
those in charge of the government, nor does it optimistically presume that 
democracies are free of 'he tendency of power-holders to expand their 
control. Embedded in a democracy, again no less than in other forms of 
government, is a structure of bureaucratic and political power. 

The task of democratic theory is to direct us toward practices that rec- 
oncile the inclination of power systems toward dominance with the dem- 
ocratic ideal of popular sovereignty. The basic terms of this reconciliation 
are to be found in the Constitution, especially in the provision for separating 
and fragmenting official power so that leaders can check and control each 
other, and in the companion provision that regular electoral competition 
will force leaders to contest with each other for the favor of the votcio. 

The general idea of this second provision is summarized in the phrase 
"theory of electoral accountability" as first adumbrated in The Federalist 
Papers and subsequently elaborated by Schumpeter and other democratic 
theorists. There is competition for public office. Leaders present themselves 
and their records to the electorate. Voters, basing their judgments on the 
past performance or estimates of future performance of leaders, elect, re- 
elect, or evict accordingly. Leaders, knowing this, and wanting to gain and 
retain office, promote policies that will attract public support. 

This theoretical formulation is a reasonably accurate though partial de- 
scription of what, in fact, does happen. The empirical evidence has been 
most compellingly presented by political scientist Morris Fiorina (1981; 
also Kramer, 1971), who has demonstrated the use voters make of retro- 
spective evaluations. Voters routinely reject incumbents who governed dur- 
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This application of numbers to the purposes of democratic accountability 
occurs in a period when many other political developments undermine the 
conditions necessary for holding public officials to account: the decline of 
party discipline, even of political parties themselves; the increased costs of 
electioneering and the related packaging of candidates by media experts; 
the growing political influence of single-issue interest organizations; the 
comparatively low rates of political participation. These trends occur as the 
political agenda is ever more crowded with issues difficult for the average 
citizen to comprehend. A weakened party-electoral system combined with 
a crowded and complicated issue agenda is not conducive to democratic 
accountability. "Against this background, it is all the more important to 
understand whether numeric descriptions of major social conditions and 
trends can improve the reasoning capacity of modern democracies. 

The hypothesis can be generalized. Just as a particular administration in 
power can be evaluated by statistical trends, so also can broad social pol- 
icies. In this generalized version, citizens continually evaluate and reev- 
aluate broad policy commitments made by previous political generations. 
In modern nation -states, this retrospective public reflection is facilitated by 
measures of long-term trends. Descriptive statistics, especially when pre- 
sented as trend lines, offer voters befoie-and-after information about the 
performance of incumbents as well as of general policies. Consequently, 
these statistics contribute to the procedures that establish accountability in 
democratic politics. If we could leave matters at this point, the story would 
be a welcome one for democratic theory. Bu f it is more complicated. 

Numbers, just as much as words, have the power to distort as well as 
enhance the reasoning capacity of the public. The greater the importance 
of numbers to the securing of power, the stronger the incentives to those 
in power to make certain that numbers present a favorable even if inaccurate 
picture. Across a broad front democratic politics must contend with ways 
in which numbers distort and mislead. 

At this point it is necessary to draw attention to an important subset of 
any nation's number system, what are called "performance indicators." 
Performance indicators typically serve tvo functions: they act as internal 
signals for the agency, telling it whether its goals are being achieved; and 
they serve as signals to those outside the agency, including of course those 
who set policy and control budgets. These two functions subject an agency 
to conflicting pressures. When an agency designs performance measures in 
a manner that maximizes internal information, it invites external attention 
to its failures as well as achievements. It risks sending negative signals that 
those having power over the agency can use to trim budgets or punish 
incompetence. 

It is a familiar complaint that when officials are rewarded or punished 
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according to statistical evaluations, they are drawn to policies that favor 
how the agency presents itself to the oversight process rather than policies 
that improve the conditions for which they have responsibility. The numbers 
become more important than the progress toward policy goals they pre- 
sumably index. Khrushchev (1977:71) is said to have lamented: "It has 
become the tradition to produce not beautiful chandeliers to adorn homes, 
but the heaviest chandeliers possible. This is because the heavier the chan- 
deliers produced, the more a factory gets since its output is calculated in 



Our interest is not in this well-known flaw in command economics, but 
in the implications for democratic accountability. If the number system is 
systematically manipulated so that personnel and policies are presented to 
the public in the most favorable light, we have little warrant for claiming 
that public statistics enhance democratic procedures. 

We come here to a point in the discussion where the larger analysis of 
democratic accountability intersects with a more specific argument about 
the professional accountability of those who administer the nation's statis- 
tical system. This accountability is to professional peers who evaluate, 
against the standards of their disciplines, whether government statistical 
agencies are maintaining the integrity of the numbers. Professional statis- 
ticians, in and out of government, hold that proper controls and procedures 
can protect the public from the abuses associated with fraudulent or mis- 
leading statistics. In recent congressional testimony, Courtenay Slater, an 
informed and experienced observer of national statistics, comments (1983:54): 
"One of the finest things about our statistical system is that our statistics 
have credibility. They are produced by professionals in the statistical agen- 
cies and the press release that gives us our economic data comes out of 
that statistical agency, It is written by professionals. Everybody knows it 
is objective, and they believe the numbers are honestly presented." 

There is no question that in v/ell-established statistical agencies, such as 
the Bureau of Labor Statistics and the Bureau of the Census, the production 
and reporting of statistics is managed by professionals. The norms of profes- 
sional control are deeply rooted in the origins of these agencies. Consider, 
for example, the history of the Bureau of Labor Statistics, which just 
celebrated its centennial. The bureau was established during the post-Civil 
War period of intense civil strife, and the statistics it produced were quickly 
implicated in the arguments of the day about the causes and consequences 
of industrial conflict. Labor reformers especially felt that if information 
about prevailing employment conditions could be put ' 'before the legislators 
and the public, a cry of mingled surprise, shame, and indignation will arise 
that will demand an entire change in the method of earnings and pay" 
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Although not disputing this assumption, the statisticians and social sci- 
entists, who by now had organized themselves as the American Social 
Science Association, wanted to ensure that the new labor statistics not be 
seen to favor any particular economic interest — radical, reformist, or con- 
servative. The advice given to and heeded by Carroll Wright, founder in 
1873 of the first state Bureau of Labor Statistics in Massachusetts and 
subsequently first commissioner of the Federal Bureau of Labor Statistics, 
is instructive (Walker, 1877:vii-viii). 

Your office has only to prove itself superior alike to partisan dictation and to the 
seductions of theory, in order to command the cordial support of the press and the body 
of citizens. ... I have strong hopes that you will distinctly and decisively disconnect 
the [bureau] from politics. 

This advice is no less heeded today than it was a century ago, and for 
the same reasons. Perhaps no stronger testimony to the credibility of our 
major statistical series is needed than the reliance placed on them by both 
the political process and the marketplace. A member of Congress (Horton, 
1983:2) comments, "It would be a public administration catastrophe if we 
were to And that the statistics we rely on so heavily did not adequately 
describe the real world of which we are a part and the problems we are 
trying to solve." In the marketplace substantial funds are routinely trans- 
ferred on the assumption that national statistical series are trustworthy. The 
monthly statistical reports of the Crop Reporting Board of the Department 
of Agriculture, for instance, have such high credibility that hundreds of 
thousands of dollars change hands through the commodity markets as soon 
as the data are released. 

But even if we accept that professional control over national statistics 
can largely eliminate fraud and greatly lessen bias in the most important 
of our social and economic indicators, other issues remain. The statistics 
of even the most professional agencies suffer from measurement problems 
for which there are no presently available solutions. When these problems 
lead to errors of serious magnitude and yet the numbers are used by political 
leaders to set policies and by citizens to evaluate these policies, the ac- 
countability process is compromised. 

The aptly labeled "unobserved economy" offers a telling illustration. 
Two scholars report (Alford and Feige, n.d.:14): "Recent research suggests 
that systematic biases associated with a large and growing sector of un- 
measured economic activity have been introduced into the system of social 
indicators. The unobserved sector escapes the social measurement apparatus 
because of accounting convention, nonreporting and underreporting." If 
the unobserved economy is growing more rapidly than the observed, that 
is, "counted" economy, but policy is guided by statistics only about the 
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latter, serious errors can hardly be avoided. This in turn of course distorts 
die process by which fault is assigned, and moves us away from democratic 
accountability. 

The technical and conceptual errors associated with measurement are 
serious, but for important economic and social indicators continuous profes- 
sional attention and public discussion offer safeguards. The scholarly com- 
munity here carries a major responsibility. Social scientists and professional 
statisticians have the technical skill — and career incentives — to discern 
discrepancies between what the statistics purport to measure and what they 
actually measure. 

These safeguards can operate only when die statistics are indeed public, 
that is, accessible to professional attention. Such is not the case for critical 
domains of national security policy, where secrecy prevails. Professional 
review of the adequacy and integrity of, say, unemployment or inflation 
measures is orders of magnitude more informed than professional review 
of numbers purporting to describe, for instance, the comparative weapon 
systems of the United States and the Soviet Union. 

In a telling essay, McGeorge Bundy (1984) compares statistics on U.S. 
nuclear weapons appeari g in two publications: the officially produced 
Defense Department Annual Report for the Fiscal Year 1985 and a privately 
sponsored report of the Natural Resources Defense Council, the Nuclear 
Weapons Databook. The official publication consistently underestimates 
American resources in a manner, Bundy argues, designed to make the 
"Russians look big" and the "Americans sm:*ll." This is the success 
indicator issue stood on its head, similar to when police departments inflate 
crime statistics to justify larger budgets. 

At issue is not the inevitable tension between the claims of national 
security and the right of the public to be informed, for we refer here only 
to those numbers that are presented by the government in public discussion, 
from the controversial "body counts" in the war of attrition in Vietnam to 
the equally controversial "missile counts" in the debates about the window 
of vulnerability. In sharp contrast to the care with which major statistical 
series affecting social and economic policies are professionally monitored, 
there has been little serious attention given to how independent professional 
controls can be applied to military numbers routinely advanced in open 
forums. 

A democratic society is preserved when the public has reliable ways of 
knowing whether policies are having the announced or promised effect — 
Is inflation being brought under control? Is a war of attrition being won? 
Are defense expenditures buying national security? Numbers, a part of this 
publicly available political intelligence, consequently contribute to the ac- 
countability required of a democracy. 
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Flaws in the statistics, whether inadvertently or deliberately introduced, 
mislead citizens about the performance of their government, thereby di- 
minishing accountability, but it can be plausibly argued that the wide public 
availability of reasonably accurate statistics about social conditions for 
which government is responsible enhances more than it diminishes dem- 
ocratic accountability. This conclusion, at best an informed guess, rests on 
assumptions about what is required if civic discourse is to be reasonably 
informed under the conditions of advanced industrial societies. It also rests 
on (largely untested) assumptions about the capacity of an electorate to 
make intelligent use of statistical information This last point we briefly 
return to in the concluding section, after reviewing the second of our two 
major issues connecting national statistics with democratic theory. 



As a document in democratic political theory, the Constitution's genius 
is in its provision for the representation of diverse interests in political 
decision circles. This commitment to representation iLvolved the founders 
in political engineering, one aspect of which established the close associ- 
ation between political representation and the nation's number system. In 
order that seats in the House of Representatives might be fairly allocated, 
the Constitution mandated a population count. It further directed that this 
count distinguish among the free citizens, the slave population, and the 
untaxed Indian population. This distinction arose because the founders 
wanted wealth as well as property to be reflected in apportionment — count- 
ing slaves as three-fifths of a person was a way to recognize their property 
value. Representation had to be apportioned according to politically ac- 
ceptable criteria. Moreover, the method chosen had to allow for adjustments 
as the population expanded, redistributing itself among the existing states 
or spilling over into territories that would later achieve statehood. Thus 
was established the decennial census, the centerpiece of our statistical 
system. 

The limited use of the census to apportion congressional seats did not 
satisfy James Madison. In early congressional debates Madison ( 1 790: 1077) 
urged that the census "embrace some other objects besides the f*are enu- 
meration of the inhabitants." Madison suggested that the census describe 
"the several classes into which the community is divided." On this basis, 
continued Madison, "the Legislature might proceed to maxe * proper pro- 
vision for the agrarian, commercial, and manufacturing interests, but with- 
out it they could never make L;ese provisions in due proportion." 

We know from The Federalist Papers that Madison viewed society as 
consisting of multiple and diverse interests. To govern such a society in a 
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democratic fashion required complex information about the composition of 
the public. Thus, for Madison, it was not enough that the census enumerate 
the population for the sole purpose of apportioning. It should be expanded 
to include many population characteristics, and thereby become the basis 
on which the legislators could allocate taxes, benefits, and services ac- 
cording to the "real situation of our constituents." In anticipating a de- 
mocracy in which numerical proportionality cuts much deeper than assigning 
congressional seats, Madison was ahead of his time. 

Madison's opponents started from a different theory of politics. Reflect- 
ing eighteenth century theories of die organic society, they "viewed the 
object of government as the pursuit of an undifferentiated common good; 
for them, politics was a sphere of virtue, and empirical investigation was 
irrelevant" (Starr, 1984:37). In the early days of the republic Madison' j 
opponents prevailed. Enumeration was sufficient to serve representation. 

Contemporary practice, however, is much closer to Madisonian plural- 
ism, as reflected in the vast expansion of the national statistical system and 
the policy uses to which it is put. The question before us now is how these 
developments in the statistical system affect the political representation 
process. 

Providing for the representation of diverse interests in political decision 
circles is at the core of the theoretical formulation known as democratic 
pluralism, the now dominant interpretation of American democracy. Dem- 
ocratic pluralism takes as its central problem die conditions that aliow for 
die participation by interested parties in various policy domains. Democracy 
requires that there be no barriers to the organization and expression of the 
full array of interests in society. 

Democratic pluralism is an attractive theory. Since the early days of the 
republic it has gradually gained adherents among those who have puzzled 
over the prospects for democracy in large-scale advanced industrial nations. 
But the theory also has its critics. In recent decades the effort to formulate 
a democratic theory has emphasized participation as opposed to pluralism, 
and in the process generated a critique of conventional pluralist theory. 

This critique holds that pluralism has not offered a satisfactory account 
of nonparticipation in democratic politics, too readily attributing low levels 
of participation to presumed citizen defects such as apathy or ignorance. 
Since levels of participation covary with social and economic resources, 
the critics argue, pluralism functions as a justification for the representation 
of middle and upper class interests in politics rather than a description of 
how the full array of social interests find a political voice. 

An alternative explanation of nonparticipation is suggested by E. E. 
Schattschneider's famous phrase, "mobilization of bias.*" In explaining 
why the socially and economic ily disadvantaged often fail to participate 
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in politics, Schattschneider wrote (1960:105): "Whoever decides what the 
game is about also decides who gets in the game/* This introduces the 
argument that what is on die political agenda provides a referent point that 
selectively mobilizes oarticipation across different social groups and inter- 
ests. Citizens participate not just to put issues on the political agenda but 
also, and more often, in response to the issues already there. This mobi- 
lization process, according to Schattschneider, is biased against the interests 
of the less well off groups in society. 

It is in this theoretical context that we consider how the analysis and 
political reporting of social statistics intersects the representation system. 
Although our emphasis is on contemporary politics, the practice we draw 
attention to is at least 150 years old. Starting around 1820, writes the 
historian Patricia Cohen (1982:169), "Many private agencies and volunteer 
groups with reformist agendas adopted the statistical approach to social 
facts in order to document the dimensions of the problem they were ded- 
icated to eradicating." Cohen offers several examples: the use of statistics 
to describe the miseries of public prisons; the effort by the temperance 
movement to prove quantitatively that alcohol abuse was a growing problem; 
and local surveys of pauperism as a basis to challenge poor laws. 

In deploying their privately collected statistics on behalf of social reform, 
the early nineteenth century activists anticipated developments surrounding 
publicly collected statistics that did not come fully into view for another 
half-century, when the federal Bureau of Labor Statistics was established 
in the 1880s. The 1820 reformers were signaling to later activists that 
statistics could mobilize political participation and inform public debate. 

In the latter part of die twentieth century these possibilities are etched 
much more deeply in our political life. The nation 's number system uncovers 
social conditions and popularizes them as statistical descriptions: proportion 
of die population below die poverty line; incidence of child abuse; persis- 
tence of structural unemployment; addictive behavior and its social costs; 
the differential in infant mortality between whites and nonwhites; the gap 
between male and female wages in similar occupations. The transformation 
of politically unnoticed social conditions into visible statistics puts issues 
on the political agenda that would otherwise be ignored. 

These statistical conditions then provide a political referent pou^t for 
interested groups. This is perhaps oik of the most striking aspects of twen- 
tieth century democratic politics. Resource-poor social interests turn to a 
statistical description of their plight to generate political pressure and to 
mobilize adherents to their cause. 

The history of the civil rights movement is suggestive in this regard. The 
concept of institutional racism, which held that black poverty was caused 
not just by racial prejudice but also by structural conditions of the economy, 
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polity, and society, made its political appearance through statistics on res- 
idential segregation, black-white income differentials, unequal educational 
opportunities, inequities in access to health care, and so forth. Civil rights 
leaders first used the numbers to emphasize the scope of institutional dis- 
crimination. They then used them to gain political support for new social 
policies such as Head start, job training, and affirmative action. Other groups 
have reached the conclusion that to be "measured" is to be politically 
noticed, and to be noticed is to have a claim on the nation's resources. 
Thus the physically handicapped in New York initially resisted being counted, 
for fear that this would lead to further stigmatizing them, but then reversed 
their position when they realized that political visibility closely followed 
on statistical visibility. 

Data presented in Michael Harrington's The Other America helped initiate 
the War on Poverty by identifying the poor as a target group for government 
action. The consumer protection movement, starting with Ralph Nader's 
Unsafe at Any Speed, has made heavy use of statistical arguments, as have 
the environmentalists. Describing public-interest citizen groups, one com- 
mentator (Henderson, 1981:441) writes that "the quality and quantity of 
information and the way it is structured, presented and amplified" shapes 
their political choices and strategies. 

Harold Wilensky (1967: 19) generalizes these observations when he writes 
that "facts and figures" assist those political interest organizations "weak 
in grass-roots political resources." Information "may give an advantage to 
the weak, whose case, if strong and technical, can count for something." 
This is not a trivial observation when examined in the context of the effort 
through the history of democracy to establish equal civil and political rights 
in the face of inequalities in resources that different social interests bring 
to the political arena. 

In democratic theory as well as actual practice, organization is most 
often promoted as the corrective when economic inequalities are repro- 
duced as differential opportunities for political participation. The less 
wealthy but more numerous social interests combine and increase their 
political strength through working-class parties, social movements, and 
interest groups. Consequently, a resource that helps to organize the re- 
source-poor will help to correct political imbalances and promote broader 
democratic participation. 

This observation leads us to consider whether statistical programs can 
actually help establish group identity and lead to the formation of interest 
organizations. In a careful account of the interplay between ethnicity and 
the census, William Petersen (1983:27) writes: "Few things facilitate a 
category's coalescence into a group so readily as its designation by an 
official body," and cites the importance of "questions put to them by 
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immigration officials and census schedules" for helping to solidify group 
identification. 

Hispanic-Americans are particularly important in this regard. More than 
any group in American political history, Hispanic- Americans have turned 
to the national statistical system as an instrument for advancing their political 
and economic interests, by making visible the magnitude of the social and 
economic problems they face. 

In the processes by which groups are formed and diverse interests are 
represented in democratic politics, public statistics are not an unmixed 
blessing. Just as some groups can establish a political identity by being 
enumerated, other groups cannot escape the way they are socially clas- 
sified because of this same enumeration system. For example, for two 
centuries we have had a statistical practice of racial classification , which 
undoubtedly has contributed to the continuing salience of race in Amer- 
ican society. Policies now being implemented could easily result in the 
Hispanic-Americans becoming a permanent racial minority in the statis- 
tical system, with what long-term effects it is difficult to foresee. More- 
over, the statistical system is not sufficiently robust to withstand the 
distortions accompanying severe political pressures. When political cri- 
teria are transparently used to determine what should be technical issues, 
such as the best way to count a population group, statistics lose their 
credibility. 

Racially sensitive measurement policies are not likely to be reversed 
soon, now that so many government services are allocated according to 
race and ethnicity. The brief period during which it was thought wrong to 
identify race, gender, or national origins on employment or school appli- 
cations was swept away by the emergence of affirmative action and statis- 
tical parity in the 1970s. The nation has entered a period in which 
"proportionate allocation" is carried to ever greater extremes. There is a 
contagion effect: Once statistical proportionality is elevated to a principle 
of government, there is great pressure from various racial and ethnic groups 
to be fully counted. 

From the perspective of democratic theory these developments are trou- 
bling in at least three respects. First, to assign to the statistical system 
responsibility for group classification and resource allocation is to transform 
the thing being measured — segregation, hunger, poverty — into its statistical 
indicator. Always in tension with the judgmental in politics is an insistent 
search for objective rules to reduce the element of arbitrariness in subjective 
judgment. The legal code is one such set of objective rules, formalized 
bureaucratic procedures another, and now statistical formulas. This search 
does not eliminate politics; it simply pushes them back one step, to disputes 
about methods. Arguments about numerical quotas, availability pools, and 
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demographic imbalance become a substitute for democratic discussion of 
the principles of equity and justice. 

Second, if statistical identification facilitates political consciousness among 
some resource-poor groups , these same statistics make invisible to the policy 
process other groups at the margins of social and economic life, where 
measurement often fails — the undocumented workers, the illegal aliens, 
and the vagrant, homeless populations. In many government programs, 
persons not counted are not there. Another difficulty stems from the inertia 
of statistical systems. For technical as well as bureaucratic reasons, statistics 
lag behind the dynamic patterns of group formation and change resulting 
from immigration, internal migration, transformation in the occupational 
structure, and new levels of social consciousness. Insofar as politics is 
organized by the numbers, there will be a tendency to overlook more 
recently established social conditions in favor of those already reflected 
through the statistical system. 

The third and most troubling danger is the shift away from a system of 
representation and public policy based on the individual citizen toward one 
based on the representation of demographic aggregates: ethnic, racial, in- 
come, gender, etc. This shift invites, even mandates, the allocation of 
benefits and rights according to group membership rather than individual 
accomplishment or need. 

To many observers this tilt toward group representation undermines the 
fundamental premise of liberal democracy. Nathan Glazer (1975:220) la- 
ments the drift toward numbering and dividing up the population into racial 
and ethnic groups: 4 'This has meant that we abandon the first principle of 
liberal society, that the individual and individual's interests and good and 
welfare are the test of a good society, for we now attach benefits and 
penalties to individuals simply on the basis of their race, color, and national 
origin." Glazer, of course, does not attribute the rise of quota politics and 
group-based representation to the availability of statistical information. But 
if statistical information has not caused, certainly it has abetted the emer- 
gence of demographically defined groups as a category in public policy. 

The formal system of political representation itself has not escaped the 
insistent pressure for demographically defined proportionality. Abigail 
Thernstrom (1983) artfully traces how the 1965 Voting Rights Act was 
transformed in two decades from a law to protect black voting rights to 
one that appears to require the "correct" number of minority seats in 
legislative bodies. Demands for proportional representation, in which the 
legislature is to mirror the characteristics of the population from which it 
is selected, are not new. Until recently, however, group politics intersecting 
with the electoral process was the preferred avenue for achieving this end. 
Legal remedies were, appropriately, limited to ensuring fair procedures, 
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not particular outcomes. Now, buttressed by statistics, laws have begun to 
affect the very composition of legislative bodies. 

As was the case in our discussion of accountability, we see in this 
discussion of representation that countertendencies are at work. On the one 
hand, statistical description can bring social conditions to public attention, 
mobilize disadvantaged groups, and broaden the political agenda in ways 
that lessen the bias inherent in an electoral representation system based 
largely on the resources of wealth and political organization. On the other 
hand, these statistics introduce practices and policies inconsistent with our 
traditional understanding of democracy: the objectification of politics; the 
assumption that that which is not counted is not there; the temptation to 
substitute group membership for individual merit or need as the basis for 
public policy; the allocation of legislative seats according to designated 
racial or ethnic criteria. 

We are far from having the evidence that would allow us to sort out the 
relative strength of these countertendencies and again must resort to an 
informed guess. With respect to democratic accountability I suggested that 
the benefits of statistical descriptions outweighed the harms. With respect 
to the representation of diverse interests I am less sanguine. The distortions 
of the representational process seem to me every bit as strong as the im- 
provements. Moreover, the negative tendencies are not of the sort that can 
be corrected with greater professional scrutiny of statistical information. 
They are much more political than technical in nature and in fact become 
stronger as statistics become more precise and reliable. 



CONCLUSIONS 

I conclude by emphasizing the theme that connects this essay with the 
efforts of Ogburn and colleagues. The present inquiry has emphasized the 
importance of close attention to the nation's number system by professional 
statisticians and social scientists. Assuring the integrity of numbers involves 
continuous improvements in measurement, revisions in concepts as social 
conditions change, and the highest standards of statistical interpretation, 
analysis, and reporting. Moreover, protecting statistical quality and integrity 
will add little to democracy unless joined to the educational task of ensuring 
that numeracy takes its place alongside literacy as a skill indispensable to 
democratic citizenship. In the absence of public understanding of statistical 
argumentation, the numbers will more likely aid political demagoguery than 
democratic discourse. 

Because of, and notwithstanding, the various problems and risks iden- 
tified in this essay, those who care about democracy have a large task before 
them: analysis of the political role of numbers, as well as a commitment 




J" 



P£/BL/C STATISTICS AND DEMOCRATIC POLITICS 111 

to making the numbers perform accc 'ding to the responsibilities that a 
democracy places upon them. 
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INTRODUCTION 



Social policy and research on the deterrence of crime have often been 
unrelated in the United States. While politicians have periodically called 
for harsher punishments to deter crime, most criminologists prior to the 
1970s either ignored the deterrence issue or voiced strong skepticism toward 
it (e.g., Sutherland, 1924:360; Reckless, 1967:504). This gap between 
policy and research is unfortunate, manifesting itself in policy initiatives 
unrefined by empirical evaluation and empirical research with little policy 
significance. 

In recent years the estrangement between criminology and social policy 
on deterrence has shown signs of abating. This chapter examines recent 
social research on two important categories of modem human misconduct — 
street crime and drunk driving — to explore the implications of recent crim- 
inological studies for policy on the deterrence of crime. 

TWO FUNDAMENTAL PERSPECTIVES ON CRIMINAL CONDUCT 

Two broad perspectives on human behavior have long competed for 
preeminence in efforts to control crime in America. Both have historical 
roots as well as present-day champions, and both have been evident in the 
operation of our legal system since its inception. The first asserts that human 
behavior may be usefully represented as the product of rational individual 
calculation; the second asserts that behavior is largely guided by nonrational 
biological, psychological, or social forces. 
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The "rational actor" perspective assumes that human beings behave to 
maximize personal pleasure and minimize pain. As elaborated by social 
reformers like Bentham and Beccaria, or jurists like Blackstone, Romilly, 
or Feuerbach, this view argues that crime can be deterred by increasing the 
costs of criminal behavior or increasing the rewards of noncriminal behav- 
ior. Contemporary criminologists generally refer to this perspective as the 
"Classical School" (Jeffery, 1972; Void, 1979). 

By contrast, the second perspective assumes that human behavior, in- 
cluding crime, is governed by forces over which the individual has relatively 
little conscious, rational awareness or control. Starting with Cesare Lom- 
broso and his students in the 1800s, criminologists have labored to discover, 
describe, and understand these forces. Contemporary criminologists refer 
to this perspective as the "Positive School." 

For those who assume that crime is caused by factors outside the of- 
fender's control, the proper role of criminology is not to investigate the 
deterrent effects of variations in the law and its enforcement but, rather, 
to help ameliorate the problem of crime by identifying and taking steps to 
alter the biological, psychological, or social conditions that produce it. 
Whereas the classical perspective suggests the possibility of deterring crime 
through manipulating actual or expected rewards and punishments, the 
positive school recommends changing nonrational elements of the offender's 
psyche or environment. The most common policy approaches for bringing 
about these changes include a variety of intervention strategies that re- 
gardless of their actual performance, have generally been justified as "re- 
habilitative" (e.g., probation, parole, indeterminate sentencing, and 
institutional treatment). 

Public policy on crime in the twentieth century has drawn from both the 
positive and classical schools. The belief that credible threats of punishment 
deter criminal behavior is probably as old as criminal law itself and has broad 
appeal to policymakers and the public. From an intuitive point of view it 
seems reasonable. Surely Chinese citizens were less likely to exceed the speed 
limit in Peking early in this century when authorities exhibited the heads of 
drivers executed for speeding alongside speed limit signs (Zimring and Hawk- 
ins, 1973:1 1). The adoption of harsh laws against crime in the United States, 
as well as the mobilization of criminal penalties to deal with specific behavioral 
problems — such as drunkenness and other drug abuse — show continued faith 
in the efficacy of deterrence. At die same time, twentieth century policymakers 
have created vast programs to rehabilitate criminals, including probation, pa- 
role, and specialized correctional facilities for juveniles and for specific cat- 
egories of convicted offenders. While some of these efforts at rehabilitation 
have been called half-hearted, no one can seriously deny that substantial 
resources have been devoted to the rehabilitative ideal. 



139 



DETERRENCE IN CRIMINOLOGY AND SOCIAL POUCY 



In contrast to the dual approach of policymakers, balancing (perhaps 
vacillating) between deterrence and rehabilitation, criminologists have by 
and large rejected deterrence principles. For over a century a host of dis- 
tinguished scholars with viewpoints as different as Enrico Fermi and Edwin 
Sutherland were able to agree on one point: deterrence does not work. 1 One 
major reason for this long-term rejection of deterrence is that criminologists, 
especially in the United States, traditionally were humanists and reformers 
(Gibbons, 1979; Wilson, 1983a). Since the early years of this century 
American criminology has had a strong social reform component, based 
on the belief that government is not merely a device to facilitate the pursuit 
by individuals of their private ends, but also a device to shape and improve 
the character of its citizenry. The reformers held that if only the right 
institutions were built, the right people properly trained to staff them, and 
the right classification procedures used to fill them, then surely rehabilitation 
would occur. 

However, in the mid-1970s a profound retreat from these assumptions 
became evident among criminologists and -olicymakers. For the first time 
in 150 years American criminologists seriously questioned whether reha- 
bilitation was a reasonable goal. State governments across the United States 
were moving away from indeterminate sentencing (long associated with the 
rehabilitative ideal), curtailing the use of probation and parole, and speaking 
against * directional" programs slanted toward rehabilitation. 

The reasons for the recent decline in the popularity of rehabilitation, both 
among policymakers and criminologists, undoubtedly warrant a separate, 
detailed account. Here we can merely summarize the more common ex- 
planations for the shift. First, renewed interest in deterrence a; pears to 
reflect the perceived failure of rehabilitation policies. This failure was 
typically "proven" by the precipitous increase in crirro rates in the last 
two decades (Wilson, 1975, 1983a, 1983b); by widely publicized prison 
disasters, such as those in Attica in 1969, and Santa Fe in 1980; and by 
mounting research evidence that many programs aimed at preventing re- 
cidivism thro jigh rehabilitation programs have been relatively ineffective 
(Martinson, 1974; Upton et al., 1975). 

Second, traditional methods of rehabilitation have come under increas- 
ingly strong attack in the last two decades on the basis of 'heir intrusiveness. 
In a series of decisions in the 1960s, most notably Escobedo Illinois 
(378 U.S. 478, 1964) and in re Gault (387 U.S. 1, 1967), the Supreme 
Court revolutionized the meaning of due process rights in American law 



Jn fact, this bclict was fully articulated by Edwin Suthcrlan 1 in the chapter he wrote on "Crime 
and Punishment" for Recent Social Trends in the United States, the Cgbum Report. 
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with important consequence^ *or the idea of rehabilitation, with its emphasis 
on individualized intervention to bring about psychological changes in of- 
fenders. Disdain among both politicians and criminologists for this type of 
intervention became increasingly evident in the late 1960s. For example, 
Struggle for Justice (1971), the influential report prepared for the American 
Friends Service Committee, asserted (p. 85) that rehabilitation rests "largely 
on speculation or on assumptions unrelated to criminality," and that de- 
cisions made about offenders are routinely made 4 4 in the absence of credible 
scientific data on the causation or treatment of crime." This critique and 
others like it attacked the fundamental assumptions of rehabilitation: that 
crime is caused by forces over which the individual has little control and 
that the criminal justice system is able to identify and correct these forces. 

Finally, a less theoretical but eminently plausible explanation for the 
decline in support for rehabilitation programs is their cost. Rehabilitation, 
especially the kind envisioned by much of the criminological literature, is 
expensive. Many states find themselves spending increasing amounts of 
scarce revenues on correctional programs that are often difficult to justify 
to taxpayers. Andrew Scull (1977) and others have argued that these purely 
economic forces, rather than humanitarian or scientific concerns, have led 
to declining support for rehabilitation. 

With diminishing support for rehabilitation, the justification for punishing 
criminals has shifted toward deterrence, and the forms of scholarship have 
likewise changed. It is difficult to find more than a half-dozen professional 
articles or books written on the subject of deterrence from 1900 to 1965. 
But starting in the 1960s, the study of deterrence has become a crimino- 
logical growth industry. Within criminology, the deterrence proposition has 
generated new interest and a large and rapidly expanding research literature 
(see Zimring and Hawkins, 1973; Andenaes, 1974; Gibbs, 1975; Cook, 
1977;Blumsteinetal., 1978; Tittle, 1980; Archer etal., 1983; for reviews). 
Stated simply, this proposition asserts that proscribed behavior is deterred 
by perceptions that legal punishments are swift, sure, and severe. 

Our main purpose in this chapter is to appraise and interpret research on 
the deterrence proposition. Berrusc researchers know little about the con- 
sequences of swifr punishment in the context of laws — there is too little 
swift punishment l mailable in our legal system to study — our exclusive 
focus is on variations in the certainty and severity of punishment. Moreover, 
evaluations of the effects of certainty and severity of punishment must 
distinguish between the legal existence (de jure) of punishment, and its 
actual use (de facto). Despite de jure changes in punishment, meaningful 
de facto changes have been rare. Thus, evaluations of the deterrence prop- 
osition are confounded by the fact that real changes in the imposition of 
punishment seldom accompany legal changes. We review prior research 
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for two important types of criminal behavior, street crime and drunk driving, 
present generalizations that have been established in these fields, evaluate 
the strength of the evidence, and interpret their meaning for social policy. 

STREET CRIMES 

Perhaps no area of deterrence research has generated as much public 
interest in recent years as attempts to reduce street crime, which generally 
means robbery, rape, assault, or murder that occurs in public places between 
people previously unacquainted. The public is concerned and fearful: the 
President's Commission on Law Enforcement (1967) found that one-third 
of all Americans were afraid to walk alone at night in their own neigh- 
borhoods, and many reported that they stayed off the streets altogether 
because of their fear of crime. Subsequent victimization surveys conducted 
by the Department of Justice (see, e.g., Hindelang, 1976) have confirmed 
the enormous impact that fear of street crime has on the behavior of citi- 
zens — especially the poor, members of minority groups, and urban resi- 
dents. "Crime in the streets" has been a recurring national and political 
issue since the mid-1960s, and strategies for dealing with street crime are 
the subject of debates, media programming, political campaigns, and gov- 
ernment commissions. In light of this interest social science knowledge 
concerning the effect of deterrence-based legal interventions bears important 
policy implications. This knowledge is summarized here. 

Certainty of Punishment 

The deterrence proposition predicts that proscribed behavior will be re- 
duced to the extent that the relevant public perceives a high likelihood of 
punishment for violations. 2 As other reviewers (e.g. , Blumstein et al. , 1978) 
have noted, relatively little effort has been made to measure this perception 
directly; the bulk of what we know simply relates aggregate measures of 
street crime to policy or program innovations that are intended to increase 
the actual chance of punishment, implicitly assuming that increases in the 
(intended or actual) likelihood of punishment will in turn lead to increases 
in its perceived certainty. This chain of assumptions may reasonably hold 
where the innovations are accomp* lied by official publicity and mass media 



2 Deterrence research distinguishes between ••general** and ••special" types. General deterrence 
is the inhibiting effect of sanctioning an offender on other potential offenders* criminal behavior. 
Special deterrence is the inhibiting effect of sancdoning an offender on his or her own future 
criminal behavior. 
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attention, but its unproved assertion is a point of weakness in much of the 
existing evidence. 

The best-studied legal innovation to increase the certainty of punishment 
has been intensive policing efforts designed to raise the risk of apprehension 
and charging. 

Police crackdowns are fairly common responses to public concern about 
street crimes, and a number of these have been submitted to evaluation. 
Of particular interest are two studies of patrol efforts in New York City, 
"Operation 25" and the "20th Precinct" studies; the San Diego Field 
Interrogation project; a study of robberies in the New York City transit 
system; the LEAA High-Impact Anti-Crime project; and the Kansas City 
Preventive Patrol project. 

One of the earliest evaluations of increased patrol's effect in reducing 
street crimes was Operation 25 in New York City (see Zimring and Hawkins, 
1973:348-349). The police department selected the twenty-fifth precinct, 
a small district with a high crime rate, for greatly increased patrol during 
a four-month experimental period. The number of foot-patrol officers within 
this district was quadrupled for the experiment, and crime rates declined 
in all categories during those four months. However, no data were available 
to investigate the possibility that Operation 25 may have shifted the location 
of crimes from the experimental precinct to adjacent areas. 

This weakness in Operation 25 was avoided in a subsequent, similar 
study in the twentieth precinct, which received a 40 percent increase in 
police manpower and also noted decreases in the rates of major crimes 
(Press, 1971). The evaluation controlled the experimental data with findings 
from adjacent districts to test for displacement effects and from distant 
districts to test for the possibility that a general decline in crime rates could 
have explained the decline observed in the experimental district. The control 
data supported the conclusion that the patrol was effective in reducing street 
crimes (i.e. , those visible from the street) and that it did not merely displace 
the criminal activity into adjacent districts. Of course, these results can also 
be questioned. The period of die experiment was only eight months and 
only changes in crimes reported to the police were measured. Moreover, 
the official records were maintained by police who were aware of the 
experiment and/or the previous findings from Operation 25. 

One of the best-designed experiments on the deterrence of street crime, 
the San Diego project (Boydstun, 1975), concerned the practice of stopping, 
questioning, and frisking persons who aroused police suspicions (i.e., con- 
ducting "field interrogations"). In one area of the city, field interrogations 
were eliminated, whereupon the number of 4 'suppressive" crimes (robbery, 
burglary, theft, auto theft, assault, sex crimes, malicious mischief, and 
disturbances) increased by about a third; when field interrogations were 
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resumed, the number of such crimes dropped back to preexperimental levels. 
There was no change in the frequency of suppressive crimes in two control 
areas where either field interrogation practices remained unchanged or po- 
lice officers woe specially trained to conduct them in light of legal pro- 
cedures and human relations principles. Because the presence or absence 
of field interrogations did not affect the number of arrests in either control 
or experimental areas, Boydstun concludes that the visibility of police 
activity was responsible for the apparent deterrence of crime. 

In response to a large increase in subway robberies (especially of toll- 
booth stations) in 1965, die New York Transit Authority introduced special 
patrols on the subways during nighttime hours. E valuators (Chaiken et al. , 
1974) found that crime rates dropped substantially during die patrol hours, 
but not during the balance of the day, for up to six years following die 
crackdown. An interesting sidelight in the study that has important impli- 
cations for deterrence research was the discovery of a "phantom effect." 
For eight months, while there were stepped-up patrols only at specific times 
and places, serious crime rates declined throughout the subway system. 
The evaluators assert that uncertainty as to the deployment of die police 
had a deterrent effect on potential offenders in nontarget areas and times. 
However, in die long run this phantom effect disappeared, possibly because 
potential felons became familiar with actual deployment practices. One 
difficulty with this study was that the evaluation was based on crime reports 
made by the participating transit police officers. After the evaluation was 
completed, researchers alleged that police officials miscoded the times of 
some offenses, apparently to exaggerate the reduction in frequency of of- 
fenses during peak patrol hours (Gallagher, 1978:176). In a reexamination 
of the evidence, Chaiken (1976) concludes that despite die falsification, 
there wis a significant deterrent effect, although of lesser magnitude than 
die original evaluation suggested. 

Perhaps die most ambitious experiment in deterring street crime ever 
attempted in die United States was the High-Impact Anti-Crime Program, 
funded for $160 million by the Law Enforcement Assistance Administration 
in 1972 (Chelimsky, 1976). Eight cities with high crime rates were targeted 
for crime-reduction programs with the goal of reducing stranger-to-stranger 
personal crime by 5 percent in two years and 20 percent in five years. Each 
city had complete discretion in designing individual programs and evalu- 
ating the consequences. Most of the money was allocated to increased 
enforcement, although some projects also aimed at streamlining court op- 
erations. Unfortunately, the variability in programs and evaluations strongly 
compromised the scientific utility of the program. A summary evaluation 
(Chelimsky, 1976) found the individual project descriptions to be flawed, 
but the summary was forced to rely on Uniform Crime Report (UCR) data 
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that omitted several theoretically crucial variables, including conviction 
rates, sentencing patterns, and arrest-to-crime rates. As Franklin Zimring 
points out (1978:144-149), the UCR data do not allow specific measures 
of stranger-to-stranger offenses, those targeted for reduction by the project. 
Even so, the final evaluation offered no explicit statistical comparison for 
crimes other than burglary. The report found that five of the eight Impact 
cities had 1974 levels of burglary lower than would have been predicted 
on the basis of extrapolations from a set of comparison cities, but no such 
differences were apparent in the remaining three cities. Thus, modest sup- 
port was obtained for the deterrence proposition from this very costly ex- 
periment. 

The Kansas City Preventive Patrol project was designed to test the relative 
effect on crime rates of three policing strategies: "proactive" patrol, with 
patrol car levels between two and three times the normal level; "normal" 
patrol; and no routine patrol, police entering the area only in response to 
calls for assistance (Kelling and Pate, 1974). These strategies were assigned 
randomly to IS contiguous districts within the city and were maintained 
for 12 months. The major dependent variables were official police statistics 
and victim reports. No deterrent results could be found for patrol at either 
"normal" or "proactive" levels. 

Critics of the Kansas City study (cited in Zimring, 1978:142-143) have 
pointed out that the districts were small, and that residents not subjected 
to patrol could still see police patrolling peripheral areas and responding 
to calls. There were, in fact, no significant differences in police response 
time or arrest rates between the three kinds of patrol districts. The com- 
parison between no-patrol and routine patrol areas might have been con- 
taminated by this proximity effect. It is notable that the Kansas City patrols 
were by car whereas prior studies reporting positive effects used foot patrols. 
However, the study was widely interpreted as failing to show that doubling 
or tripling of police patrol could measurably affect crime rates. 



The deterrence proposition also predicts that proscribed behavior will be 
reduced to the extent that the relevant public perceives great severity of 
punishment for violations. Most research on this hypothesis compares ju- 
risdictions that have death penalty provisions with those having (presumably 
less severe) prison sentences; compares jurisdictions with longer and shorter 
prescribed or actual prison sentences for various offenses; or examines 
longitudinal effects of changes in punishment severity on official crime 
rates (for reviews, see Andenaes, 1974; Zeisel, 1976; Blumstein et al., 
1978). Regardless of methods used, there is little evidence directly bearing 



Severity of Punishment 





DETERRENCE IN CRIMINOLOGY AND SOCIAL POLICY 



137 



on perceptions of severity; the tacit assumption is that perceptions of severity 
generally accord with the severity of formal legal prescriptions. This as- 
sumption may be false in specific circumstances. 

The Death Penalty Capital punishment obviously cannot be experi- 
mentally manipulated, and examinations of its deterrent effect have been 
limited to comparisons of homicide rates in contiguous states with and 
without die death penalty (Campion, 1955; Sellin, 1959); to examinations 
of time-series data on homicide rates within one or more jurisdictions that 
change capital punishment laws (Sellin, 1959; Walker, 1969); and to com- 
parisons of homicide rates within a jurisdiction before and after the im- 
position of a death sentence or execution (Graves, 1956; Savitz, 1958). 
Although these studies have generally failed to find evidence for a deterrent 
effect of capital punishment, they have serious methodological problems 
that compromise their probative value. The main problems lie in their 
inability to control for demographic, cultural, and socioeconomic factors 
other than the death penalty that could affect rates of serious criminality, 
and their failure to distinguish between the formal prescription of the death 
penalty and its actual application. 

Recent support for the deterrence proposition in the matter of capital 
punishment has been reported in a well-known study by Isaac Ehrlich 
(1975), who examined aggregate U.S. data on homicide rates and capital 
punishment for the yeers 1932-1970. After performing an elaborate set of 
statistical analyses, Ehrlich concluded that capital punishment does deter 
homicide, offering a specific estimate of the magnitude of the effect (p. 398): 
"On the average the tradeoff between the execution of an offender and the 
lives of potential victims it might have saved was of the order of magnitude 
of 1 for 8 for the period 1933-1967 in the United States." 

Ehrlich's study, introduced to the Supreme Court by the Solicitor General 
in Fowler v. North Carolina (95 Sup. Court 223, 1975), has been widely 
cited in support of capital punishment legislation and its imposition in 
individual cases. Because of the importance of the findings and the fact 
that it is one of the few studies to report a deterrent effect for capital 
punishment, its supporting data have been reanalyzed several times (e.g., 
Bowers and Pierce, 1975; Klein et al., 1978). These analyses show that 
Ehrlich's findings are sensitive to minor changes m the form of the analysis. 
Among the most striking is the consequence of changing the time period 
over which the analysis is made: the negative relationship between homicide 
rates and executions is present only when the years 1962-1969 are included 
in the analysis, and these were unusual years for the United States in that 
both homicide and all other street crimes increased dramatically while the 
frequency of executions declined steeply (and ceased entirely in 1968; see 
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Bowers and Pierce, 1975:197-202). 3 Thus, the probative value of the Ehr- 
lich study is doubtful. 

The bulk of the existing literature on whether capital punishment deters 
crime more than other forms of punishment (e.g., life imprisonment) is 
based on data gathered in situations in which few convicted offenders 
actually receive capital punishment (and the method for selecting which 
offenders receive it appears highly capricious despite recent Supreme Court 
decisions aimed at clarifying standards), so the generally negative findings 
must be understood as limited to situations in which actual likelihood of 
punishment is low. It is possible that if executions were applied with higher 
likelihood they might have an effect on homicide rates, but the wisdom of 
such a policy is primarily a moral and ethical matter, not a scientific 
question. In any event, policy decisions about capital punishment are more 
likely to be affected by moral and ethical considerations than by the crude 
estimates of deterrent effects that present social research lias produced. 

Other Punishments The quantity of studies of the deterrent effects of 
noncapital punishments on street crime is somewhat more impressive (for 
a review, see Nagin, 1978). However, there is jnly one study we know of 
where the design rises to the level of a quasi experiment. Schwartz (1968) 
studied die effect of increased statutory penalties for rape and attempted 
rape on the frequency of these crimes in Philadelphia. Following a brutal 
rape case that received a great deal of media attention, the state of Penn- 
sylvania raised the maximum penalty for rape by a factor of two or three, 
depending on the severity of the case. Schwartz examined reported rape 
rates for the period surrounding the change and concluded that neither the 
frequency nor the seriousness of rape changed significantly after the new 
law was passed. The study has obvious weaknesses, most notably the use 
of reported cases of a crime that is notoriously underreported. However, it 
avoids the general problem that affects the balance of the known studies: 
the impossibility of adequately controlling statistically for all the important 
social, economic, and demographic variables other th?n deterrence-based 
laws that can affect crime rates. 

The general picture painted by the bulk of studies relating severity of 
punishment for street crimes as measured by length of sentences (prescribed 
or actual) to crime rates is less favorable to the deterrence proposition 
(Chiricos and Waldo, 1970; Forst, 1976). The predominant finding is of 
no significant relationship. In an exhaustive review of the evidence for a 



3 A detailed methodological critique of Ehriich's analysis by Klein et a!. (1978:343-351) cites 
several other reasons for questioning his results, including omitted variables. 
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deterrent effect of sentence severity on crime, Nagin (1978:110) concludes 
that at best the results are "equivocal." As with the death penalty studies, 
though perhaps to a lesser degree, these studies also take place against the 
background of a relatively low risk of any punishment. Hence, results more 
favorable to the deterrence proposition might be found if severe punishments 
were perceived to be likely in the event of violations by the population to 
which the threat is addressed. 

Summary 

On the matter of certainty of punishment, we find some support for the 
deterrence proposition in the literature on street crime. With few exceptions, 
crime rates are found to decline when measures are adopted to increase the 
certainty of punishment. However, there is much weaker confirmation for 
the deterrence proposition in the matter of severity of punishment, with a 
few studies claiming an effect contradicted by numerous studies finding no 
effect. Because all empirical research in the area of severity of penalties 
takes place in situation where the objective likelihood of punishment is 
very low, scientific generalizations and policy decisions based on this lit- 
erature should be appropriately qualified. 

DRUNK DRIVING 

Unlike the case for street crimes, active public interest in the isoie of 
drunk driving is relatively recent, and its depth and persistence have not 
yet been tested. The acute current concern about drunk driving has resulted 
in a flood of deterrence-based legal interventions that promise to increase 
our understanding of deterrence both in the specific case and in general. 
Laws have been passed and enforcement campaigns undertaken with the 
aim of increasing the perceived certainty and severity of punishments for 
drunk driving, forming a pool of natural experiments that can be subjected 
to analysis. Furthermore, the results can be ascertained using indexes such 
as weekend night fatalities, which are often both validly and reliably mea- 
sured by official statistics agencies, and which correlate strongly with al- 
cohol-impaired driving. There now exists a relatively large body of knowledge 
in this area and additional experience is rapidly accumulating. 

Certainty of Punishment 

Two types of legal interventions regarding drunk driving are directed 
primarily at increasing the objective (and hence, presumably perceived) 
certainty of punishment. The first of these is the replacement of laws that 
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define the offense in behavioral terms with "Scandinavian-type" or "per 
se" laws, which define the offense in terms of blood-alcohol concentrations, 
measurable by instruments. Typically these laws require only that a driver 
exceed the tolerated level in order to justify arrest and to secure conviction; 
there is no need to demonstrate drunken behavior or to produce other 
evidence of impairment, a matter that under previous law was a severe 
handicap in detecting and prosecuting drivers whose chances of experienc- 
ing a crash were substantially increased by the consumption of alcohol. 
The second type of intervention is the enforcement crackdown, in which 
police resources devoted to drunk-driving patrols are abruptly increased. 
We will review the literature accumulated to date on the deterrent effects 
of these interventions. 

Scandinavian-type Laws These laws originated in Norway in 1936 and 
Sweden in 1941, where they formed part of general accumulations of legal 
restrictions on drunk driving and on overall alcohol use. In the original 
countries these laws were not very much noticed, being relatively small 
incremental steps in the accretion of policy. However, over the years, the 
Scandinavian countries won a reputation (not completely earned, in our 
opinion) for having dealt successfully with drunk driving, and the laws 
were copied in other jurisdictions where they represented a sharp break 
with tradition and therefore were much more noticeable. The first important 
adoption of this model outside Scandinavia came in Great Britain in 1967. 
The Road Safety Act of that year prohibited driving or attempting to drive 
with a blood-alcohol concentration greater than .08 percent (the level that 
a man of medium build might reach after drinking four or five drinks on 
an empty stomach in the period of an hour). It was proposed that police 
be empowered to stop any driver and administer a screening breath test for 
Wood alcohol (using a new device imported in mass quantities from West 
Germany, with much fanfare). Although this provision was rejected by 
Parliament the final legislation permitted a test on anyone involved in an 
accident (regardless of fault) or committing a serious traffic law violation. 

In addition to receiving official publicity, the British Road Safety Act 
was strengthened by media attention that continued for years as the complex 
law was challenged on numerous grounds by defendants seeking to escape 
the mandatory penalty of a year's license suspension. Evaluations of the 
law initially showed substantial deterrent effects: weekend night fatalities 
and serious injuries declined by more than half immediately following the 
imposition of the new rule, and attribution of the decline to the law was 
supported by the failure of comparable casualties to decline during non- 
drinking hours, as well as by behavioral data reported in polls and other 
sources (Ross, 1973; Saunders, 1975). However, the effect of the law was 
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despite the initial negative results reported for their Scandinavian-type laws, 
campaigns of enforcement for these laws were accompanied by impressive 
declines in casualties in the affected jurisdictions. 

Among the most ambitious enforcement efforts were the U.S. Alcohol 
Safety Action Projects (ASAPs) funded by the United States Department 
of Transportation in 35 sites in the mid-1970s. It is estimated that more 
than $200 million in public funds was spent on these projects, which cen- 
tered on increasing patrol as well as on streamlining the processing of the 
accused in the criminal justice system. Unfortunately, very much like the 
High-Impact Anti-Crime program for street crimes, the structuring of the 
ASAPs varied by city, and the local evaluations were on the whole incom- 
petent. However, a final evaluation by competent if perhaps not disinterested 
U.S. Department of Transportation staff (1979) did find evidence for a 
deterrent effect, as measured by greater reductions in nighttime than in 
daytime fatal crashes, in 12 of the 25 sites, and in 8 of the 13 sites where 
the absolute level of nighttime crashes and a moderate population growth 
rate rendered evaluation less problematic. 

Severity of Punishment 

Although efforts to increase the severity of punishment ' drunk drivers 
have probably been much more frequent than those directed at certainty, 
there are few published evaluations. The efforts have taken the form of 
increasing statutory penalties and of judicial crackdowns increasing the 
actual penalties for drunk driving in various jurisdictions. 

Statutory Changes Many statutory changes in the penalty for drunk driv- 
ing have been accomplished as part of broad packages of countermeasures, 
some of whi^h also relate to increasing certainty. One example of a law that 
appears to have been directed only toward increasing the perceived severity 
of penalties took p.ace in Finland in 1950, when the maximum sentence for 
dmnk driving was doubled from two to four years, with the provision for six 
years in the event of serious IxxPy injury resulting from the offense, and 
seven years for causing death. Although there was a decline in crash-related 
fatalities in subsequent years, it proved not to be possible to attribute this 
decline to the law because it was greater for less serious crashes than for fatal 
crashes, whereas the latter are more likely to involve drunk driving. Further- 
more, the drop was greater for multiple-vehicle crashes than for single-vehicle 
crashes, the latter again more likely to involve alcohol (Ross, 1975). 

Judicial Crackdowns In Chicago in 1970 the supervising judge of the 
traffic court decreed that aJl defendants judged guilty of drunk driving during 
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the Christmas holidays should receive a seven-day jail sentence. Although 
subsequently the number of crashes declined, and officials made broad 
claims of success for this campaign, careful analysis of the data found that 
the decline could not be distinguished from chance variation. Furthermore, 
data from Milwaukee, chosen as a comparison jurisdiction, showed an even 
greater proportional decline, although as in Chicago it was not statistically 
significant (Robertson et al., 1973). 

Similar findings are reported from a city in New South Wales, Australia, 
where a local magistrate declared his intention to increase greatly the pen- 
alties for drunk drivers. Research showed that serious crashes did not decline 
perceptibly even though, unlike in Chicago, the threatened penalties were 
actually put into effect in most cases (Misner and Ward, 1975). 

There is evidence in these studies that the criminal justice system reacts 
in a way that vitiates the declared severity of actual punishments. For 
example, in Chicago, it was found thai convictions declined where drivere 
were accused in the absence of chemical tests in evidence. In a more recent 
study of jurisdictions adopting mandatory jail sentences for first offender 
DUI (driving under the influence) defendants, Gropper et al. (1983) report 
that there were important increases in not-guilty pleas, in jury trials, and 
in failures to appear for trial as well as dismissals and not-guilty findings. 
Moreover, the eventual punishment for those nonetheless convicted was 
slowed by considerable increases in delay between arrest and conviction. 

Summary 

The research to date on attempts to deter drunk drivere suggests that 
measures directed at increasing the perceived certainty of punishment can 
have a sharp, immediate deterrent effect on the proscribed behavior. Rates 
of crashes likely to involve alcohol decline sharply at the inception of well- 
publicized laws that simplify apprehension and prosecution, and during 
well-publicized campaigns of police enforcement. The extent of the ob- 
served declines in crashes is impressive in light of the fact that crashes 
involve many causal factors other than alcohol. Deterrent effects have been 
found in virtually all well-designed studies of significant interventions, in 
many countries throughout the world. However, these effects univereally 
disappear over time, a matter of several months or a few years at most. 
One possible explanation for this fact is that the very low levels of actual 
likelihood of punishment are insufficient to continue an initial impression 
of reasonable certainty of punishment for the violator. 

No deterrent effect is evident for legal interventions that are directed 
solely at increasing the severity of punishment. Although there are only a 
few reported studies supporting this generalization, there are no negative find- 
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ings. It seems plausible to attribute this disconfirmation of deterrent expec- 
tations to the very low risk of punishment of any kind, which permits the 
violator to regard the threat as negligible, and to possible public perception 
of the fact that the criminal justice system does not necessarily deal any more 
severely after those interventions than before. Severity-based interventions are 
found to produce undesired and unanticipated side effects through the discretion 
of legal actors, behavior that may reflect the sense that actual offenses detected 
are a haphazard selection from a much larger population of undetected offenses, 
so that those charged are in a meaningful sense unlucky. 

RESEARCH AND POLICY IMPLICATIONS 

We must conclude that, as tests of the scientific validity of the deterrence 
proposition, existing research is inadequate. For many years social science 
research simply ignored the issue of deterrence. More recently the size and 
scope of the deterrence literature has increased dramatically, but it is still 
incapable of resolving basic theoretical questions, for several reasons. 

First, evaluations of the deterrent effects of punishment are often based 
on changes in formal laws rather than changes in actual enforcement be- 
haviors. This issue is particularly important with regard to sanction severity. 
Proclaimed increases in the severity of penalties repeatedly have been found 
to be vitiated by the reluctance or incapacity of legal agents to actually 
implement the new penalties. As for certainty, most interventions may be 
described as having increased the objective probability of punishment from 
"negligible" to "trivial" levels. The sheer resistance of the criminal justice 
system to piecemeal implementation of new penal sanctions may be the 
major finding ol studies ostensibly testing the deterrence proposition. 

Second, most prior research on deterrence relies on the unsupported as- 
sumption that changes in objective levels of certainty and severity of punish- 
ment are reflected in the perceptions that are the subject of the theoretical 
proposition being tested. Where the risk levels are extremely low, as is common 
in the situations being studied here, and where actual punishments are not 
increased despite policymakers' intentions, it is hazardous to assume that 
perceived certainty and severity of punishment have been changed. 

Third, and more serious for studies of street crime than of drunk driving, 
is the general inadequacy of research design. Much of the research on street 
crime is based on correlational and econometric analyses, ine defects of 
which have been well exposed (e.g., Greenberg, 1977; Blumstein et al., 
1978). In the few classical experiments, control groups are often contam- 
inated by proximity to experimental groups. Reliability and validity of 
measurement are serious problems for most of the studies of street crime 
interventions, and many of the time-series quasi experiments fail to control 
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for the possibility that events other than the deterrence-based legal inter- 
ventions may have caused declines in violations. For example, interventions 
against drunk driving in recent years have frequently coincided with eco- 
nomic transitions or crises in fuel availability. 

But despite the admitted weaknesses of evidence in individual studies, 
the accumulated literature supports a number of tentative conclusions. The 
deterrence proposition is generally supported in evaluations of legal inter- 
ventions bearing on the certainty of punishment for the offender. In the 
short run, at least, there is clear evidence that offense rates decline. In the 
long run, however, matters are not so clear, very likely because the deterrent 
effects depend on an overestimation of the chances of apprehension by the 
relevant public due to publicity and media attention surrounding the inter- 
ventions. This impression may be difficult to maintain in the face of daily 
experience that fails to support it. 

In contrast, there is very little evidence favoring deterrence in the matter 
of severity of punishment, even in the short run. One explanation is that 
the relevant public may readily learn or come to expect that the declared 
severity of penalties is compromised by resistance to change on the part of 
legal actors. An even more appealing explanation lies in a possible inter- 
action between perceived severity and certainty: where the likelihood of 
any punishment is very low, as it very often seems to be, the prospective 
offender discounts even severe penalties as negligible. 

Accepting this statement of the evidence, several questions can be raised 
for policy considerations. First, why does the deterrence approach, espe- 
cially in the matter of severe punishment, remain so popular as a basis for 
countermeasures against these and other social problems? Second, what are 
the prospects for obtaining long-term deterrent results through increasing 
the probabilities of punishment? Third, what alternatives might be proposed 
as a basis for more rational countermeasures? 

The Continued Reliance on Deterrence 

One reason for the continued tendency to invoke deterrence-related coun- 
termeasures may be the intuitive appeal of the detetence proposition. In- 
trospection informs us that we often refrain from prohibited acts because 
of threatened punishment. Moreover, our daily experience in the market- 
place confirms an economic counterpart of the deterrence proposition, which 
is that when the price of any good is raised, everything else remaining the 
same, less is consumed. If this intuitive confirmation fails us in nonmarket 
circv istances, it may be because much criminal and other socially proH- 
lematic behavior faces threats with unusually small probabilities of fulfill- 
ment, in which rational calculation and behavior are noted to be uncertain. 
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The utilities involved in deciding whether to drink and drive, or to steal 
from despised or helpless victims, may be more like those associated with 
gambling, where hordes of people enjoy participation in mathematically 
unfair games, than like ordinary market behavior. 

Second and, we think, very important, the cost of deterrence-based coun- 
termeasures tends to be delayed and obscure, while the costs of alternatives 
may be daunting. Increased severity, in particular, can seenangly be invoked 
with the stroke of a pen. In fact, as the grossly overcrowded conditions in 
many of the nation's prisons now attest, increasing the severity of punish- 
ment may be more costly in practice than it appears when considered as a 
basis for policy. 

Third, deterrence-based measures frequently relieve established institu- 
tions and vested interests that might have much to lose from other coun- 
termeasures. The appeal of deterrence-based approaches to drunk driving, 
for instance, rests heavily on the assumption that the problem lies with a 
small minority of irresponsible deviants. Neither the alcoholic beverage nor 
the automobile industry bears responsibility in this conception of the prob- 
lem. Likewise, massive public expenditures aimed at deterring street crime, 
though ineffective, focus attention on individual deviants and away from 
possible defects in the social structure, including gross inequalities in life 
chances among different perpetrator groups. 

Fourth, deterrence-based reasoning may provide a cover for retribution- 
based motives fueling popular movements. The person injured by a mugger 
or drunk driver understandably may feel that muggers and drunk drivers 
deserve punishment, "nit demands for action based on this feelLig may be 
more successful when legitimated by the promise of reductions in future 
damage from these causes. Perhaps in part for this reason, the anti-drunk- 
driving movement has been more insistent on enacting severe penalties than 
on requiring occupant-protection devices that would be much more effective 
in reducing fatalities. 



Deterrence-based policy seems to founder on the low actual rates of 
apprehension and punishment for offenders. It is possible that greatly in- 
creased investments in criminal justice might in the end be effective; nothing 
in the research literature refutes this possibility. But we doubt that this is 
a profitable line of endeavor. There is little in accumulated experience with 
street crimes or drunk driving to suggest where, in the scale of probability 
of punishment, a threshold of appreciably greater effeuxveness may lie, 
but it seems likely to involve levels of police intrusiveness and expense 
otherwise unknown in democratic societies. The American public is deeply 
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ambivalent about this approach and fundamentally negative when presented 
with unbiased estimates of the costs. For example, although the San Diego 
study showed that field interrogation by police might be an effective crime 
deterrent, many commentators have rejected this kind of police activity. 
Charles Reich (1979:113) has put it thus: "I fully recognize that safety is 
important and that safety requires measures. But other qualities also require 
measures: I mean independence, boldness, creativity, high spirits." Many 
Americans envy the tranquility of Japanese urban society, but most would 
find the Japanese system of policing, in which neighborhood police provide 
their headquarters with updated information on the daily lives of residents 
of their jurisdiction, too intrusive. 

The research literature also indicates that legal acton iesist substantive 
changes in their job-related behavior. This may be caused by feelings of 
injustice when the prescribed penalty fails to fit the crime, or it may simply 
be rationalized human laziness or procedural resistance to change. As Zim- 
ring (1978:171) states the case, "The resiliency of courts and police when 
policy changes are induced by outside money investments is formidable." 
Thus, in a recent study of the response of police to the mandate to "do 
somethirg about rape" in a large midwestern city (LaFree, 1981), it was 
clear that the police changed those things easiest to change — primarily 
recordkeeping— while doing little about the things that the deterrence prop- 
osition suggests would be most important for actually reducing rape, such 
as increasing arrests and filing more felony complaints. 

Alternatives to Deterrence 

The social science literature on both street crime and drank driving 
suggests then it may be more fruitful to consider offenders as rational 
reactors to their social environments rather than as irrational, maladapted, 
or pathological deviants. The case may be easier to make for the general 
population in such matters as white-collar crime (e.g., filing deceptive tax 
returns) than among street criminals, but the literature suggests that when 
the world is viewed from the deviant's position in society much of ihe 
problematical behavior appears normal and predictable (cf. Lempert, 1981- 
1982, for a discussion of this point among fathers ordered to pay child 
support). Surely the case is easily made for the drunk driver, who exists 
in a society that institutionalizes the use of alcohol as a social lubricant and 
mandates dependence on the private automobile for most of its members. 
Because the probability of the most severe consequence is in reality min- 
uscule (one fatality for about one third of a million miles of drunk driving), 
it seems easy for the individual driver departing a tavern late at night to 
answer the question, "How do I get home?" 
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For both street crime and drunk driving, this line of thought has led to 
suggestions for modifying the situation, rather than the individual, to reduce 
the problematic behavior. Street crimes may perhaps be better reduced by 
"hardening the target" than by deterring or, for that matter, reforming 
potential criminals. For example, Oscar Newman (1972) has written ex- 
tensively on how urban design can reduce crime by making living areas 
more "defensible.'* Similarly, drunk driving has been reduced in the 18- 
to 21 -year-old age group as a consequence of raising the drinking age in 
various states (Wagenaar, 1982). Effects are also being found for the re- 
striction of young drivers to daylight (nondrinking-hour) driving only (Preusser 
et al. , 1984). There is evidence for the view that raising the price of alcoholic 
beverages through taxation would reduce drunk driving along with a host 
of other alcohol-related problems (Moore and Gerstein, 1981). Much better 
public transportation, including subsidized taxi-like service, is another 
promising countermeasure. 

A somewhat different approach to these problems is based on accepting 
the difficulty of fundamental changes in social institutions as well as be- 
havior, and slaving to make the results of the problematic behavior less 
damaging to the victims. For example, drunk driving would be of relatively 
less consequence if it did not entail deaths and injuries. By "padding" the 
car with passive restraints and the highway with soft shoulders and yielding 
barriers around fixed obstacles, the inevitable crashes caused by alcohol 
(and a host of ether factors) could be better absorbed by society. Similarly, 
the cost of ouch activities as burglary could be lowered, though not elim- 
inated, by social insurance schemes to repay the vicrims' financial losses. 

Perhaps the main reason these types of countermeasures are less attractive 
than those based on deterrence is that their costs are out front, and they 
are not trivial. We further admit that they are unlikely to be "solutions" 
to the problems they address; most are merely mitigating. However, it 
strikes as as a more rational policy to experiment along these lines in hopes 
of finding economically sound mldgants than to follow the chimera of 
deterrence-based solutions that experience repeatedly shows to be inade- 
quate in the context for which they are proposed. 

* * * 

We would like to acknowledge the helpful comments of Gwynn Nettler 
and Jack Gibbs. 
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RISKY CHOICE 

The making of decisions is commonly complicated by the presence of 
uncertainty or risk. In general we cannot predict with certainty tomorrow's 
weather, the outcome of a medical treatment, or the future value of gold. 
Hence the decisions whether to undergo surgery, cany an umbrella, or buy 
gold must be made without advance knowledge of their consequences. It 
is therefore natural that the study of decisionmaking under risk has focused 
on choices between simple gambles with monetary outcomes and specified 
probabilities in the hope that these simple problems will reveal basic atti- 
tudes toward risk and value. 

We shall describe an approach to the analysis of risky choice that derives 
many of its hypotheses from a psychophysical analysis of value and prob- 
ability. Psychophysics is the study of the relations between physical mag- 
nitudes, such as length or money, and their psychological counterparts, 
such as perceived length or utility. 

The psychophysical approach to decisionmaking can be traced to a re- 
markable essay that Daniel Bernoulli published in 1738 (Bernoulli, 1738/ 
1954) in which he attempted to explain why people arc generally averse to 
risk and why risk aversion decreases with increasing wealth. To illustrate 
risk aversion and Bernoulli's analysis, consider the choice between a pros- 
pect that offers an 85 percent chance to win $1,000 (with a 15 percent 
chance to win nothing) and the alternative of receiving $800 for sure. A 
large majority of people prefer the sure thing over the gamble, although 
the gamble has higher (mathematical) expectation. The expectation of a 
monetary gamble is a weighted average, where each possible outcome is 
weighted by its probability of occurrence. The expectation of the gamble 
in this example is .85 x $1,000 + .15 x $0 = $850, which exceeds the 
expectation of $800 associated with the sure thing. The preference for the 
sure gain is an instance of risk aversion. In general, a preference for a sure 
outcome over a gamble that has higher or equal expectation is called risk 
averse, and the rejection of a sure thing in favor of a gamble of lower or 
equal expectation is called risk seeking. 

Bernoulli suggested that people do not evaluate prospects by the expec- 
tation of their monetary outcomes, but rather by the expectation of the 
subjective value of these outcomes. The subjective value of a gamble is 
again a weighted average, but now it is the subjective value of each outcome 
that is weighted by its piobability. To explain risk aversion within this 
framework, Bernoulli proposed that subjective value, or utility, is a concave 
function of money. In such a function, the difference between the utilities 
of $200 and $100, for example, is greater than the utility difference between 
$1,200 and $1,100. It follows from concavity that the subjective value 
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chance to win nothing) and the alternative of receiving $800 for sure. A 
large majority of people prefer the sure thing over the gamble, although 
the gamble has higher (mathematical) expectation. The expectation of a 
monetary gamble is a weighted average, where each possible outcome is 
weighted by its probability of occurrence. The expectation of the gamble 
in this example is .85 x $1,000 + .15 x $0 = $850, which exceeds the 
expectation of $800 associated with the sure thing. The preference for the 
sure gain is an instance of risk aversion. In general, a preference for a sure 
outcome over a gamble that has higher or equal expectation is called risk 
averse, and the rejection of a sure thing in favor of a gamble of lower or 
equal expectation is called risk seeking. 

Bernoulli suggested that people do not evaluate prospects by the expec- 
tation of their monetary outcomes, but rather by the expectation of the 
subjective value of these outcomes. The subjective value of a gamble is 
again a weighted average, but now it is the subjective value of each outcome 
that is weighted by its piobability. To explain risk aversion within this 
framework, Bernoulli proposed that subjective value, or utility, is a concave 
function of money. In such a function, the difference between the utilities 
of $200 and $100, for example, is greater than the utility difference between 
$1,200 and $1,100. It follows from concavity that the subjective value 
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attached to a gain of $800 is more than 80 percent of the value of a gain 
of $1 ,000. Consequently, the concavity of the utility function entails a risk 
averse preference for a sure gain of $800 over an 80 percent chance to win 
$1,000, although the two prospects have the same monetary expectation. 

It is customary in decision analysis to describe the outcomes of decision i 
in terms of total wealth. For example, an offer to bet $20 on the toss of a 
fair coin is represented as a choice between an individual's current wealth 
W and an even chance to move to W + $20 or to W - $20. This 
representation appears psychologically unrealistic: People do not normally 
think of relatively small outcomes in terms of states of wealth but rather 
in terms of gains, losses, and neutral outcomes (such as the maintenance 
of the status quo). If the effective carriers of subjective value are changes 
of wealth rather than ultimate states of wealth, as we propose, the psycho- 
physical analysis of outcomes should be applied to gains and losses rather 
than to total assets. This assumption plays a central role in a treatment or 
risky choice that we called prospect theory (Kahneman and Tversky , 1979). 
Introspection as well as psychophysical measurements suggest that subjec- 
tive value is a concave function of the size of a gain. The same generalization 
applies to losses as well. The difference in subjective value between a loss 
of $200 and a loss of $100 appears greater than the difference in subjective 
value between a loss of $1,200 and a loss of $1,100. When the value 
functions for gains and for losses are pieced together, we obtain an S-shaped 
function of the type displayed in Figure 1 . 

The value function shown in Figure 1 is (a) defined on gains and losses 
rather than on total wealth, (b) concave in the domain of gains and convex 
in the domain of losses, and (c) considerably steeper for losses than for 
gains. The last property, which we label loss aversion, expresses the in- 
tuition that a loss of $X is more aversive than a gain of $X is attractive. 



VALUE 



LOSSES 




GAINS 



FIGURE 1. A hypothetical value function. 
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Loss aversion explains people's reluctance to bet on a fair coin for equal 
stakes: The attractiveness of the possible gain is not nearly sufficient to 
compensate for the aversiveness of the possible loss. For example, most 
respondents in a sample of undergraduate; refused to stake $10 on the toss 
of a coin if they stood to win less than $30. 

The assumption of risk aversion has played a central role in economic 
theory. However, just as the concavity of the valu* of gains entails risk 
aversion, the convexity of the value of leases entails risk Peeking. Indeed, 
risk seeking in losses is a robust effect, oarticularly when the probabilities 
of loss are substantial. Consider, for example, a situation in which an 
individual is forced to choose between an 85 percent chance to lose $1,000 
(w th a 15 percent chance to lose nothing) and a sure loss of $300 A large 
majority *f people express a preference for the gamble over the sure loss. 
This is a risk-seeking choice becav > f he expectation of the gamble ( - $850) 
is inferior to the expectation of the sure loss (- $800). Risk seeking in the 
domain of losses has been confirmed by several investigators (Fishburn and 
Kochenberger, 1979; Hershey and Schoemaker, 1980; Payne et al., 1980; 
Slovic et al. , 1982). It has also been observed * . n nonmonetary outcomes, 
such as hours of pain (Eraker and Sox, 1981) and loss of human lives 
(Fischhoff, 1983; Tversky 1977; Tversky and Kahneman, 1981). Is it 
wrong to be risk averse in the domain of gains and risk seeking in the 
domain of losses? These pi Jerences conform to compelling intuitions about 
the subjective value of gains and losses, and the presumption is that people 
should be entitled to their own values. However, we shall see that an S- 
shaped value function has implications that are normative ly unacceptable. 

To address the normative issue we turn from psychology to decision 
theory. Modem decision theory can be said to begin with the pioneering 
work of von Neumann and Morgenstern (1947), who laid down several 
qualitative principles, or axioms, that should govern the preferences of a 
national decisionmaker. Their axioms included transivity (if A is preferred 
to B and B is preferred to C, then A is preferred to C), and substitution (if 
A is preferred to B, then an even cha; > to get A or C is preferred to an 
even chance to get B or C), along with other conditions of a more technical 
nature. The normative and the descriptive status of the axioms of rational 
choice have been the subject of extensive discussions. In particular, there 
*.s convincing evidence that people do not always obey the substitution 
adorn, and considerable disagreement exists about the normative merit of 
this axiom (e.g. , Allais and Hagen, 1979). However, all analyses of rational 
choice incorporate two principles: dominance and invariance. Dominance 
demands that if prospect A is at least as good as prospect B in t cry respect 
and better than B in at least one respect, then A should be preferred to B. 
Invariance requires that the preference order between prospects should not 



ERIC 



185 



CHOICES, VALUES. AND FRAMES 



157 



depend on the manner in which they are described. In particular, two 
versions of a choice problem that are recognized to be equivalent when 
shown together should elicit the same preference even when shown sepa- 
rately. We now show that the requirement of invariance, however elemen- 
tary and innocuous it may seem, cannot generally be satisfied. 

Framing of Outcomes 

Risky prospects are characterized by their possible outcomes and by the 
probabilities of these outcomes. The same option, however, can be framed 
or described in different ways (Tversky and Kahneman, 1^81). For example, 
the possible outcomes of a gamble can be framed either as gains and losses 
relative to the status quo or as asset positions that incorporate initial wealth. 
Invariance requires that such changes in the description cf outcomes should 
not alter the preference order. The following pair of problems illustrates a 
violation of this requirement. The total number of respondents in each 
problem is denoted by N y and the percentage who chose each option is 
indicated in parentheses. 

Problem 1 (N = 152): Imagine that the United States is preparing for 
the outbreak of an unusual Asian disease, which is expected to kill 600 
people. Two alternative programs to combat the disease have been proposed. 
Assume that the exact scientific estimates ot the consequences of the pro- 
grams are as follows: 

If Program A is adopted, 200 people will be saved. (72%) 

If Program B is adopted, there is a one-third probability that 600 people 
will be saved and a two-thirds probability that no people will be 
saved. (28%) 

Which of the two programs would you favor? 

The formulation of Problem 1 implicitly adopts as a reference point a 
state of affairs in which the disease is allowed to take its toll of 600 lives. 
The outcomes of the programs include the reference state and two possible 
gains, measuied by the number of lives saved. As expected, preferences 
are risk averse: A clear majority of respondents prefer saving 200 lives for 
sure over a gamble that offers a one-third chance of saving <)00 lives. Now 
consider another problem in which the same cover story h followed by a 
different description of the prospects associated with the two programs: 

Problem 2 (N = 155): 

If Program C is adopted, 400 people will die. (22%) 
If Program D is adopted, there is a one-third probability that nobody will 
die and a two-thirds probability that 600 people will die. (78%) 
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It is easy to verify that options C and D in Problem 2 are undistingu«shable 
in real terms from options A and B in Problem 1, respectively. The second 
version, however, assumes a reference state in which no one dies of the 
disease. The best outcome is the maintenance of this state and the alter- 
natives are losses measured by the number of people that will die of the 
disease. People who evaluate options in these terms are expected to show 
a risk seeking preference for the gamble (option D) over the sure loss of 
400 lives. Indeed, there is more risk seeking in the second version of the 
problem than the^ is risk aversion in the first. 

The failure of in variance is both pervasive and robust. It is as common 
among sophisticatec respondents as among naive ones, and it is not elim- 
inated even when the same respondents answer both questions within a few 
minutes. Respondents confronted with their conflicting answers are typically 
puzzled. Even after rereading the problems, they still wish to be risk averse 
in the "lives saved" version; they wish to be risk seeking in the "lives 
lost" version; and they also wish to obey invariance and give consistent 
answers in the two versions. In their stubborn appeal, framing effects 
resemble perceptual illusions more than computational errors. 

The following pair of problems elicits preferences that violate the dom- 
inance requirement of rational choice. 

Problem 3 (N = 86): Choose between: 

E. 25% chance to win $240 and 

75% chance to lose $760 (0%) 

F. 25% :hance to win $250 and 

75% chance to lose $750 (100%) 

It is easy to see that F dominates E. Indeed, all respondents chose accord- 
ingly. 

Problem 4 (N = 150): Imagine t*-at you face the following pair of 
concurrent decisions. First examine both decisions, then indicate the options 
you prefer. 

Decision (i) Choose between: 

A. a sure gain of $240 (84%) 

B. 25% chance to gain $1,000 and 

75% chance to gain nothing (16%) 
Decision (ii) Choose between: 

C. a sure loss of $750 (13%) 

D. 75% chance to lose $1,000 and 

25% chance to lose nothing (87%) 

As expected from the previous analysis, a large majority of subjects made 
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a risk-averse choice for the sure gain over the positive gamble in the first 
decision, and an even larger majority of subjects made a risk-seeking choice 
for the gamble over the sure loss in the second decision. In fact, 73 percent 
of the respondents chose A and D and only 3 percent chose B and C The 
same pattern of results was observed in a modified version of the problem, 
with reduced stakes, in which undergraduates selected gambles that they 
would actually play. 

Because the subjects considered the two decisions in Problem 4 simul- 
taneously, they expressed in effect a preference for A and D ovei B and 
C. The preferred conjunction, however, is actually dominated by the re- 
jected one. Adding the sure gain of $240 (option A) to option D yields a 
25 percent chance to win $240 and a 75 percent chance to lose $760. This 
is precisely option E in Problem 3. Similarly, adding the sure loss of $750 
(option C) to option B yields a 25 percent chance to win $250 and a 75 
percent chance to lose $750. This is precisely option F in Problem 3. Thus, 
the susceptibility to framing and the S-shaped value function produce a 
violation of dominance in a set of concurrent decisions. 

The moral of these results is disturbing: Invariance is normatively es- 
sential, intuitively compelling, and psychologically unfeasible. Indeed, we 
conceive only two ways of guaranteeing invariance. The first is to adopt a 
procedure that will transform equivalent versions of any problem into the 
same canonical representation. This is the rationale for the standard ad- 
monition to students of business, that they should consider each decision 
problem in terms of total assets rather than in terms of gains or losres 
(Schlaifer, 1959). Such a representation would avoid the violations of in- 
variance illustrated in the previous problems, but the advice is easier to 
give than to follow. Except in the context of possible ruin, it is more natural 
to consider financial outcomes as gains and losses rather than as states of 
wealth. Furthermore, a canonical representation of risky prospects requires 
a compounding of all outcomes of concurrent decisions (e.g., Problem 4) 
that exceeds the capabilities of intuitive computation even in simple prob- 
lems. Achieving a canonical represent? f ion L even more difficult in other 
contexts such as safety, health, or quality of life. Should we advise people 
to evaluate the consequence of a public health policy (e.g., Problems 1 and 
2) in terms of overall mortality, mortality due to diseases, or the number 
of deaths associated with the particular disease under study? 

Another approach that could guarantee invariance is the evaluation of 
options in terms of the j actuarial rather than their psychological conse- 
quences. The actuarial criterion has some appeal in the context of human 
lives, but it is clearly inadequate for finance choices, as has been generally 
recognized at :east since Bernoulli, and it is entirely inapplicable to out- 
comes that lack an objective metric. We conclude that frame invariance 
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making the decision weights highly unstable in that region. The over- 
weighting of low probabilities reverses the pattern described above: It en- 
hances the value of long shots and amplifies the aversiveness of a small 
chance of a severe loss. Consequently, people are often risk seeking in 
dealing with improbable gains and risk averse in dealing with unlikely 
losses. Thus, the characteristics of decision weights contribute to the at- 
tractiveness of both lottery tickets and insurance policies. 

The nonlinearity of decision weights inevitably leads to violations of 
invariance, as illustrated in the following pair of problems: 

Problem 5 (N = 85): Consider the following two-stage game. 

In the first stage, there is a 75% chance to end the game without winning 
anything and a 25% chance to move into the <^cond stage. If you reach 
the second stage you have a choice between: 

A. a sure win of $30 (74%) 

B. 80% chance to win $45 (26%) 
Your choice must be made before the game starts, i.e. , before the outcome 

of the first s;age is known. Please indicate the option you prefer. 

Problem 6 (N = 81): Which of the following op'.ions do you prefer? 

C. 25% chance to win $30 (42%) 

D. 20% chance to win $45 (58%) 

Because there is one chance in four to move into the second stage in 
Problem 5, prospect A offers a .25 probability of winning $30, and prospect 
B offers 25 x .80 = .20 probability of winning $45. Problems 5 and 6 



17 o 



CHOICES. VALUES. AND FRAMES 



161 




0 .5 1.0 

STATED PROBABILITY: p 

FIGURE 2. A hypothetical weighting function. 



making the decision weights highly unstable in that region. The over- 
weighting of low probabilities reverses the pattern described above: It en- 
hances the value of long shots and amplifies the aversiveness of a small 
chance of a severe loss. Consequently, people are often risk seeking in 
dealing with improbable gains and risk averse in dealing with unlikely 
losses. Thus, the characteristics of decision weights contribute to the at- 
tractiveness of both lottery tickets and insurance policies. 

The nonlinearity of decision weights inevitably leads to violations of 
invariance, as illustrated in the following pair of problems: 

Problem 5 (N = 85): Consider the following two-stage game. 

In the first stage, there is a 75% chance to end the game without winning 
anything and a 25% chance to move into the <^cond stage. If you reach 
the second stage you have a choice between: 

A. a sure win of $30 (74%) 

B. 80% chance to win $45 (26%) 
Your choice must be made before the game starts, i.e. , before the outcome 

of the first s;age is known. Please indicate the option you prefer. 

Problem 6 (N = 81): Which of the following op'.ions do you prefer? 

C. 25% chance to win $30 (42%) 

D. 20% chance to win $45 (58%) 

Because there is one chance in four to move into the second stage in 
Problem 5, prospect A offers a .25 probability of winning $30, and prospect 
B offers 25 x .80 = .20 probability of winning $45. Problems 5 and 6 



17 o 



162 



DANIEL KAHNEMAN and AMOS TVERSKY 



are therefore identical in terms of probabilities and outcomes. However, 
the preferences aie not the ^ ne in the two versions: A clear majority favors 
the higher chance to win the smallei amount in Problem 5, whereas the 
majority goes the other way in Problem 6. This violation of invariance has 
been confirmed with both real and hypothetical monetary payoffs (the pres- 
ent results are with real money), with human lives as outcomes, and with 
a nonsequential representation of the chance process. 

We attribute the failure of invariance to the interaction of two factors: 
the framing of probabilities and the nonlinearity of decision weights. More 
specifically, we propose that in Problem 5 people ignore the first phase, 
which yields the same outcome regardless of the decision that is made, and 
focus their attention on what happens if they do reach the second stage of 
the game. In that case, of course, they face a sure gain if they choose option 
A and an 80 percent chance of winning if they prefer to gamble. Indeed, 
people's choices in the sequential version are practically identical to the 
choices they make between a sure gain of $30 and an 85 percent chance 
to win $45. Because a sure thing is overweighted in comparison with events 
of moderate or high probability (see Figure 2) the option that nuiy lead to 
a gain of $30 is more attractive in the sequential version. We call this 
phenomenon the pseudo-certainty effect because an event that is actually 
uncertair is weighted as if it were certain. 

A closely related phenomenon can be demonstrated at the low end of 
the probability range. Suppose you are undecided whether or not to purchase 
earthquake insurance because the premium is quite high. As you hesitate, 
your friendly insurance agent comes forth with an alternative offer "For 
half the regular premium you can be fully covered if the quake occurs on 
an odd day of the month. This is a good deal because for half the price 
you are covered for more than half the days." Why do most people find 
such probabilistic insurance distinctly unattractive? Figure 2 suggests an 
answer. Starting anywhere in the region of lorv probabilities, the impact 
on the decision weight of a reduction of probability from p to pll is con- 
siderably smaller than the effect of a reduction from pll to 0. Reducing 
the risk by half, then, is not worth half the premium. 

The aversion to probabilistic insurance is significant for three reasons. 
First, it undermines the classical explanation of insurance in terms of a 
concave utility function. According to expected utility theory, probabilistic 
insurance should be definitely preferred to normal insurance when the latter 
is just acceptable (see Kahnerian and Tversky, 1979). Second, probabilistic 
insurance represents many forms of protective action, such as having a 
medical checkup, buying new tires, or installing a burglar alarm system. 
Such actions typically reduce the probability of some hazard without elim- 
inating it altogether. Third, the acceptability of insurance can be manipu- 
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latcd by the framing of the contingencies. An insurance policy that covers 
fire but not flood, for example, could be evaluated either as full protection 
against a specific risk, (e.g. , fire) or as a reduction in the overall probability 
of property loss. Figure 2 suggests that people greatly undervalue a reduction 
in the probability of a hazard in comparison to the complete e 1 " ^ination of 
that hazard. Hence, insurance should appear more attractive when it is 
framed as the elimination of risk than when it is described as a reduction 
of risk. Indeed, Slovic, Rschhoff, and Lichtenstein (1982) showed that a 
hypothetical vaccine that reduces the probability of contracting a disease 
from 20 pence * to 10 percent is less attractive if it is described as effective 
in half of fie cases than if it is presented as fully effective against one of 
two exclusi /e and equally probable virus strains that produce identical 
symptoms. 

Formulation Effects 

So far we have discussed framing as a tool to demonstrate failures of 
invariance. We now turn attention to the processes that control the framing 
of outcomes and events. The public health problem illustrates a formulation 
effect in which a change of wording from "lives saved" to "lives lost" 
induced a marked shift in preference from risk aversion to risk seeking. 
Evidently, the subjects adopted the descriptions of the outcomes as given 
in the question and evaluated the outcomes accordingly as gains or losses. 
Another formulation effect was reported by McNeil, Pauker, Sox, and 
Tversky (1982). They found that preferences of physicians and parents 
between hypothetical therapies for lung cancer varied markedly when their 
probable outcomes were described in terms of mortality or survival. Sur- 
gery, unlike radiation therapy, entails a risk of death during treatment. As 
a consequence, the surgery option was relatively less attractive when the 
statistics of treatment outcomes were described in terms of mortality rather 
than in terms of survival. 

A physician, and perhaps a presidential advisor as veil, could influence 
the decision made by the patient or by the President, without distorting or 
suppressing information, merely by the framing of outcomes and contin- 
gencies. Formulation effects can occur fortuitously, without anyone being 
aware of the Hpact of the frame on the ultimate decision. They can also 
be exploited jliberately to manipulate the relative attractiveness of options. 
For example, Thaler (1980) noted that lobbyists for the credit card industry 
insisted that any price difference between cash and credit purchases be 
labeled a cash discount rather than a credit card surcharge. The two labels 
frame the price difference as a gain or as a loss by implicitly designating 
either the lower or the higher price as normal. Because losses loom larger 
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than gains, consumers are less likely to accep. a surcharge than to forego 
a discount. As is to be expected, attempts to infl jence framing are common 
in the marketplace and in the political arena. 

The evaluation of outcomes is susceptible to formulation effects because 
of the nonKnearity of the value function and the tendency of people to 
evaluate options in relation to the reference point that is suggested or implied 
by the statement of the problem. It is worthy of note that in other contexts 
people automatically transform equivalent messages into the same repre- 
sentation. Studies of language comprehension indicate that people quickly 
recode much of what they hear into an abstract representation that no longer 
distinguishes whether the idea was expressed in an active or in a passive 
form and no longer discriminates what was actually said from what was 
implied, presupposed, or implicated (Clark and Clark, 1977). Unfortu- 
nately, the mental machinery that performs these operations silently and 
effortlessly is not adequate to perform the task of recoding the two versions 
of the public health problem or the mortality-survival statistics into a com- 
mon abstract form. 



Our analysis of framing and of value can be extended to choices between 
multiattribute options, such as the acceptability of a transaction or a trade. 
We propose that, in order to evaluate a multiattribute option, a person sets 
up a mental account that specifies the advantages and the disadvantages 
associated with the option, relative to a multiattribute reference state. The 
overall value of an option is given by the balance of its advantages and its 
disadvantages in relation to the reference state. Thus, an option is acceptable 
if the value of its advantages exceeds the value of its disadvantages. This 
analysis assumes psychological — but not physical — separability of advan- 
tages and disadvantages. The model does not constrain the manner in which 
separate attributes are combined to form overall measures of advantage and 
of disadvantage, but it imposes on these meaf is assumptions of concavity 
and of loss aversion. 

Our analysis of mental accounting owes a large debt to the stimulating 
work of Richard Thaler (1980, in press), who shewed the relevance of this 
process to consumer behavior. The following problem, based on examples 
of Savage (1954) and Thaler (1980), introduces some of the rules that 
govern the construction of mental accounts and illustrates the extension of 
the concavity of value to the acceptability of transactions. 

Problem 7: Imagine that you are about to purchase a jacket for $125 and 
a calculator for S 1 5. The calculator salesman informs you that the calculator 
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prices is surely controlled by shoppers' efforts to find the best buy, ihese 
results suggest that consumers hardly exert more effort tc save $15 on a 
$150 purchase than to save $5 on a $50 purchase. 

The topical organization of mental accounts leads people ic evaluate 
gains and losses io relative rather than in absolute terms, resulting in large 
variations in die rate at which money is exchanged for other things, such 
as the number of phone calls made to find a good buy or the willingness 
to drive a long distance to get one. Most consumers will find it easier to 
buy a car stereo system or a Persian rug, respectively, in the context of 
buying a car or a house than separately. These observations, of course, run 
counter to the standard rational theory of consumer behavior, which assumes 
invariance and does not recognize the effects of mental accounting. 

The following problems illustrate another example of mental accounting 
in which the posting of a cost to an account is controlled by topical or- 
ganization: 

Problem 8 (N = 200): Imagine that you have decided to see a play and 
paid the admission price of $10 per ticket. As you enter the theater, you 
discover that you have lost the ticket. The seat was not marked, and the 
ticket cannot be recovered. 

Would you pay $10 for another ticket? 
Yes (46%) No (54%) 

Problem 9(N = 183): Imagine that you have decided to see a play where 
admission is $10 per ticket. As you enter the theater, you discover that you 
have lost a $10 bill. 

Would you still pay $10 for a ticket for the play? 
Yes (88%) No (12%) 

The difference between the responses to the two problems is intriguing. 
Why are so many people unwilling to spend $10 after having lost a ticket, 
if they would readily spend that sum after losing an equivalent amount of 
cash? We attribute the difference to the topical organization of mental 
accounts. Going to the theater is normally viewed as a transaction in which 
the cost of the ticket is exchanged for the experience of seeing the play. 
Buying a second ticket increases the cost of seeing the play to a level that 
many respondents apparently find unacceptable. In contrast, the loss of the 
cash is not posted to the account of the play, and it affects the purchase of 
a ticket only by making the individual feel slightly less affluent. 

An interesting effect was observed when the two versions of the problem 
were presented to the same subjects. The willingness to replace a lost ticket 
increased significantly when that problem followed the lost-cash version. 
In contrast, the willingness to buy a ticket after losing cash was not affected 
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by prior presentation of the other problem. The juxtaposition of the two 
problems apparently enabled the subjects to realize that it makes sense to 
think of the lost ticket as lost cash, but not vice versa. 

The normative status of the effects of mental accounting is questionable. 
Unlike earlier examples, such as the public health problem, in which the 
two versions differed only in form, it can be argued that the alternative 
versions of the calculator and ticket problems differ also in substance. In 
particular, it may be more pleasurable to save $5 on a $15 purchase than 
on a larger purchase, and it may be more annoying to pay twice for the 
same ticket than to lose $10 in cash. Regret, frustration, and self-satisfaction 
can also be affected by framing (Kahneman and Tversky, 1982). If such 
secondary consequences are considered legitimate, then the observed pref- 
erences do not violate the criterion of invariance and cannot readily be ruled 
out as inconsistent or erroneous. On the other hand, secondary consequences 
may change upon reflection. The satisfaction of saving $5 on a $15 item 
can be marred if the consumer discovers that she would not have exerted 
the same effort to save $10 on a $200 purchase. We do not wish to rec- 
ommend that any two decision problems that have the same primary con- 
sequences should be resolved in the same way. We propose, however, that 
systematic examination of alternative framings offers a useful reflective 
device that can help decisionmakers assess the values that should be attached 
to the primary and secondary consequences of their choices. 

Losses and Costs 

Many decision problems take the form of a choice between retaining the 
status quo and accepting an alternative to it, which is advantageous in some 
respects and disadvantageous in others. The analysis of value that was 
applied earlier to unidimensional risky prospects can be extended to this 
case by assuming that the status quo defines the reference level for all 
attributes. The advantages of alternative options will then be evaluated as 
gains and their disadvantages as losses. Because losses loom larger than 
gains, the decisionmaker will be biased in favor of retaining the status quo. 

Thaler (1980) coined the term "endowment effect" to describe the re- 
luctance of people to part from assets that belong to their endowment. When 
it is more painful to give up an asset than it is pleasurable to obtain it, 
buying prices will be significantly lower than selling prices. That is, the 
highest price that an individual will pay to acquire an asset will be smaller 
than the minimal compensation that would induce the same individual to 
give up that asset, once acquired. Thaler discussed some examples of the 
endowment effect in the behavior of consumers and entrepreneurs. Several 
studies have reported substantial discrepancies between buying and selling 
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prices in both hypothetical and real transactions (Gregory, 1983; Hammack 
and Brown, 1974; Knetsch and Sinden, in press). These results have been 
presented as challenges to standard economic theory, in which buying and 
selling prices coincide except for transaction costs and effects of wealth. 
We also observed reluctance to trade in a study of choices between hy- 
pothetical jobs that differed in weekly salary (Si and in the temperature (T) 
of the workplace. Our respondents were asked to imagine that they held a 
particular position (S b l x ) and were offered the option of moving to a 
different position (S 2 , T 2 ), which was better in one respect and worse in 
another. We found that most subjects who were assigned to (Si, Ti) did 
not wish to move to (S 2 , T 2 ), and that most subjects who were assigned to 
the latter position did not wish to move to the former. Evidently, the same 
difference in pay or in working conditions looms larger as a disadvantage 
than as an advantage 

In general, loss aversion favors stability over change. Imagine two he- 
donically identical twins who find two alternative environments equally 
attractive. Imagine further that by force of circumstance the twins are 
separated and placed in the two environments. As soon as they adopt their 
new states as reference points and evaluate the advantages and disadvantages 
of each other's environments accordingly, the twins will no longer be 
indifferent between the two states, and both will prefer to stay where they 
happen to be. Thus, the instability of preferences produces a preference for 
stability. In addition to favoring stability over change, the combination of 
adaptation and loss aversion provides limited protection against regret and 
envy by reducing the attractiveness of foregone alternatives and of others' 
endowments. 

Loss aversion and the consequent endowment effect are unlikely to play 
a significant role in routine economic exchanges. The owner of a store, for 
example, does not experience money paid to suppliers as losses and money 
received from customers as gains. Instead, the merchant adds costs and 
revenues over some period of time and evaluates only the balance. Matching 
debits a: 1 credits are effectively cancelled prior to evaluation. Payments 
made by consumers are also not evaluated as losses but as alternative 
purchases. In accord with standard economic analysis, money is naturally 
viewed as a proxy for the goods and services that it could buy. This mode 
of evaluation is uade explicit when an individual has in mind a particular 
alternative, such as #4 I can either buy a new camera or a new tent." In this 
analysis, a person will buy a camera if its subjective value exceeds the 
value of retaining the money it would cost. 

There are cases in which a disadvantage can be framed either as a cost 
or as a loss. In particular, :he purchase of insurance can also be framed as 
a choice between a sure loss and the risk of a greater loss. In such cases 
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the cost-loss discrepancy can lead to failures of invariance. Consider, for 
example, the choice between a sure loss of $50 and a 25 percent chance 
to lose $200. Slovic et al. (1982) reported that 80 percent of their subjects 
expressed a risk-seeking preference for the gamble over the sure loss. 
However, only 35 percent of subjects refused to pay $50 for insurance 
against a 25 percent risk of losing $200. Similar results were also reported 
by Schoemaker and Kunreuther (1979) and by Hershey and Schoemaker 
(1980). We suggest that the same amount of money that was framed as an 
uncompensated loss in the first problem was framed as the cost of protection 
in the second. The modal preference was reversed in the two problems 
because losses are more aversive than costs. 

We have observed a similar effect in the positive domain, as illustrated 
by the following pair of problems: 

Problem 10: Would you accept a gamble that offers a 10% chance to 
win $95 and a 90% chance to lose $5? 

Problem 1 1: Would you pay $5 to participate in a lottery that offere a 
10% chance to win $100 and a 90% chance to win nothing? 

A total of 132 undergraduates answered the two questions, which were 
separated by a short filler problem. The order of the questions was revereed 
for half the respondents. Although : is easily confirmed that the two prob- 
lems offer objectively identical options, 55 of the respondents expressed 
different preferences in the two versions. Among them, 42 rejected the 
gamble in Problem 10 but accepted the equivalent lottery in Problem 11. 
The effectiveness of this seemingly inconsequential manipulation illustrates 
both the cost-loss discrepancy and the power of framing. Thinking o f the 
$5 as a payment makes the venture more acceptable than thinking of the 
same amount as a loss. 

The preceding analysis implies that an individual's subjective state can 
be improved by framing negative outcomes as costs rather than as losses. 
The possibility of such psychological manipulations may explain a para- 
doxical form of behavior that could be labeled the dead-loss effect. Thaler 
(1980) discussed the example of a man who develops tennis elbow soon 
after paying the membership fee in a tennis club and continues to play in 
agony to avoid wasting his investment. Assuming that the individual would 
not play if he had not paid the membership fee, the question arises: How 
can playing in agony improve the individual's lot? Playing in pain, we 
suggest, maintains the evaluation of the membership f ee i. a cost. If the 
individual were to stop playing, he would be forced to recognize the fee 
as a dead loss, which may be more aversive than playing in pain. 
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CONCLUDING REMARI 

The concepts of utility and value are comnu *y used in two distinct 
senses: (a) experience value, the degree of pleasure or pain, satisfaction or 
anguish in the actual experip«ice of an outcome; and (b) decision value, the 
contribution of an anticipated outcome to the overall attractiveness or aver- 
siveness of an option in a choice. The distinction is rarely explicit in decision 
theory because it is tacitly assumed that decision values and experience 
values coincide. This assumption is part of the conception of an idealized 
decisionmaker who is able to predict fv* ire experiences with perfect ac- 
curacy and evaluate options accordingly. Fur ordinary decisionmakers, how- 
ever, the correspondence of decision values between experience values is 
far from perfect (March, 1978). Some factors that affect experience are not 
easily anticipated, and some factors that affect decisions do not have 
comparable impact on the experience of outcomes. 

In contrast to the large amount of research on decisionmaking, there has 
been relatively little systematic exploration of the psychophysics that relate 
hedoJ" experience to objective states. The most basic problem of hedonic 
psychophysics is the determination of the level of adaptation or aspiration 
that separates positive from negative outcomes. The hedonic reference point 
is largely determined by he objective status quo, but it is also affected by 
expectations and social comparisons. An objective improvement can be 
experienced as u loss, for example, when an employee receives a smaller 
raise than everyone else in the office. The experience of pleasure or pain 
associated with a change of state is also critically dependent on the dynamics 
<f hedonic adaptation. Brickman and Camr^eirs (1971) concept of the 
hedonic treadmill suggests the radical hypothesis that rapid adaptation will 
cause the effects of any objective improvement to be short-lived. The 
complexity and subtlety of hedonic experience make it difficult for the 
decisionmaker to anticipate the actual experience that outcomes will pro- 
duce. Many a person who ordp. * a meal when ravenously hungry has 
admitted to a big mistake when th \ fifth course arrived on the table. The 
common mismatch of decision values and experience values introduces an 
additional element of uncertainty in maii> decision problems. 

The prevalence of framing effects and violations of in variance further 
complicates the relation between decision values and experience values. 
The framing of outcomes often induces decision values that have no coun- 
terpart in actual experience. For example, the framing of outcomes of 
therapies tor lung cancer in terms of mortality or survival is unlikely to 
affect experience, although it can have a pronounced influence on choice. 
In other cases, however, the framing of decisions affects not only decision 
but experience as well. For example, the framing of an expenditure as an 
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uncompensated loss or as the price of insurance can probably influence the 
experience of that outcome. In such cases, the evaluation of outcomes in 
the context of decisions not only anticipates experience but also molds it. 
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designed methods one could find ways to pose rather complex questions 
to infants and young children. A substantial new body of data has now 
accumulated about the capacities of infants and young children and stands 
in contrast to the older emphasis on what they lack. From these data a 
contemporary view has emerged that the very young can be competent, 
active agents of their own conceptual development. In short, the mind of 
the young child has come to life. 

This essay is divided into four parts. First, we introduce the seminal 
theoretical ideas that have influenced psychologists' conceptions of the 
child's emergent mind. Next, we delineate some of the evidence in support 
of infant and preschool cognitive competence and illustrate some of the 
methods developed to make the study of young minds plausible. Finally, 
we ask how this putative youthful brilliance interacts with formal learning 
tasks in school, emerging with a seeming paradox: Young children seem 
to know more than we thought possible, but older children in schools seem 
to be much less competent ttu. was once assumed. The natural learning 
settings of young children are contrasted with the formal environments they 
encounter at school, and we see that r stnctional programs that capitalize 
on young children's natural propensities to create and test theories can 
significantly accelerate learning. 



The first step away from the empiricists' "tabula rasa" view of the infant 
mind was taken by the Swiss psychologist Jean Piaget. Beginning in the 
1920s, Piaget argued for the need to postulate complex cognitive structures 
in the young human mind, which empiricist accounts of human thought 
had tended to play down cr deny. Piaget did not think that human infants 
are born with innate cognitive structures, but rather that structures develop 
due to the child's ever-present tendency to engage the environment actively, 
interpreting it in accordance with progressively changing cognitive "schemes." 
From close observations of infants and vareful questioning of children, he 
concluded that cognitive development proceeds through certain stages, each 
involving radically different cognitive schemes, so that sometimes young 
children even form practical convictions con' r ary to those held by older 
children and adults. 

While Piaget observed that infants actually seek environmental stimu- 
lation that promotes their intellectual development, he thought that their 
initial representations of objects, space, time, cause, and self are constructed 
only gradually during the first two years. He concluded tf?.t the world of 
young infants is an egocentric fusion of the internal and external worlds, 
and that the development of an accurate representation of physical reality 
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depends on the gradual coordination of schemes cf looking, listening, and 
touching. Piaget thought that for many months the infant does not realize 
that an object producing a given sound is the same as an object that looks 
a certain way. The very young infant, up to 10 months or so, was said to 
think that an object exists only as long as she can touch, hear, or see it; 
once out of direct sensory contact, it ceased to exist. From this view, it 
followed that babies do not represent an independent space in which three- 
dimensional objects exist. In this regard, Piaget's account of infant cognition 
is actually close to being empiricist; still the position that cognitive schemes 
are actively constructed rather than passively impressed separates him from 
empiricists (or in modern terminology, behaviorists). 

Noam Chomsky (1957), focusing on language, proposed that the human 
mind is innately prepared to leam language without needing much help 
from the environment. He provided explicit hypotheses about the nature of 
the language structures that produce and comprehend language, an account 
that held out the promise of explaining how young children can say things 
they have never heard, e.g., *Tm unthirsty,** "I have two footses," "I 
werted home." Chomsky's hypotheses are still controversial (Wanner and 
GIiMttnan 1982), but the effect of his work gave strong impetus to a 
"narivist" account of mental abilities, which maintains that humans are 
born with conceptual structures that guide the acquisition cf knowledge 
about the world. 

Like Piaget, the Gibsons have maintained that infants actively explore 
the environment, but in sharp contrast, they deny that the infant slowly 
constructs the wortf. They maintain that, shortly after birth, the infant's 
world is a remarkably veridical one, filled with three-dimensional objects 
in real space, not unconnected elementary sensations. They support their 
view with findings that neonates integrate sight and sound and respond as 
if they assume that the world is out there waiting to be explored. The 
Gibsons assign a role to learning but propose that it proceeds rapidly due 
to the initial availability of exploration patterns that can yield accurate 
information about objects and events. 

Simon (1972) and his colleagues (e.g., Klahr and Wallace, 1976) helped 
introduce a somevvhat different perspective, that development m.ars over- 
coming information-processing constraints, such as limited short-term mem- 
ory capacity and lack of general knowledge. Those working in the information- 
processing tradition focused both on the possibility that early failures m 
completing Piagetian tasks are due in part to limits on processing capacity 
and the conditions under which children actively employ strategies for 
problem solving and knowledge acquisition. 

All these theoretical developments challenged the empiricist account and 
influenced the direction of research in developmental psychology. The claim 
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thai young children have different mental structures and ideas about the 
world was taken up in investigations of their concepts, strategies, and 
problem-solving abilities. These studies led to the conclusion that, despite 
the many differences between young and old, the young have remarkable 
abilities to participate actively in their acquisition of knowledge. 

STUDYING INFANT KNOWLEDGE 

Because infants are so limited physically, experimc .ts to find out what 
they know and how they think have had to find methods suitable to the 
level of infant motor capabilities. A good example is a method used by 
Kalnins and Bruner (19^3). Tbey showed 5- to 12- week-old infants a silent 
color film and gave the infants a pacifier to suck, tar nipple of which was 
connected to a pressure switch controlling the projector lens. The infants 
quickly learned to suck at a given rate to bring the movie into focu£, showing 
not only that they were capable of and interested in learning how to control 
their own sensory environment but also that they preferred a ciear image 
to a blurry one. 

A second method demonstrates — and depends on — an infant's thirst for 
novelty. The 4 'habituation paradigm" involves presenting babies with a 
stimulus — a picture, sound, or series of sounds — to whicl <he baby attends 
cither by looking at it, turning to it, or doing something to keep the stimulus 
on. Over a series of trials, infants, like everyone else, stop responding to 
repeated plantations of the same stimulus; that is, they habituate. They 
recover ink*— if a recognizably different stimulus is presented. For ex- 
ample, four-month-old hfants will suck vigorously when first introduced 
to the phoneme (speech sound) "ba," then gradually lose interest and stop 
sucking in response to it. But when presented a different phoneme, "pa/' 
they resume sucking (Eimas et al., 1971). 

Fantz (1961, 1966) directed attention to the power of the preference 
method to study infants* tendency to explore. He determined what infants 
looked at by watching their eyes closely. Infants lying on their backs in 
his laboratory could look up to the left or right at, for example, a bull's 
eye and a checkerboard. The experimenter recorded whether and for how 
long the baby looked left or right. Even newborns chose to look at patterned 
displays over homogenous gray ones. Infants generally prefer somewhat 
novel displays over ones they have seen before (e.g., Kagan et al., 1978; 
Kessenetal., 1972). 

Studies like these do more than simply show that infants actively select 
experiences; they can also tell us what the infant is capable of perceiving 
and knowing. Recovery of interest in a novel speech sound could not occut 
if infants co;id not recognize the rather subtle difference between "pa" 
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and <4 ba." (See Aslin et al., 1983.) The same holds for visual preferences, 
discovering that very young infants can see, hear, smell, and be particular 
about what exactly ihey perceive led to an emboldened attitude about the 
kinds of experimental questions that could be asked. The answers about 
infant understanding of the physical and numerical properties of objects 
have been quite remarkable. 

Early Knowledge of Objects 

Raget concluded that before infants could know about objects, they would 
have to discover regularities between their sensations and actions, then 
gradually integrate the sense-action schemes formed when they touched, 
heard, and looked at objects, and finally come to appreciate the object ai 
a separate reality in the external world. Like the empiricists, Piaget thought 
infants responded to the immediate stimuli, i.e., flashes of light on the 
retina or sound waves in the eardrum, long before they recognized sources 
of stimuli. 

Recent experiments (Gibson and Spelke, 1983; Karris, 1983) have told 
a different story. For example, Spelke (1976) used visual-preference meth- 
ods to determine that four-month-old infants already integrate the sight and 

sound of an event. Infants were shown two films projected side b v side 

a person playing peek-a-boo and a hand beating a tambourine. The sound 
accompaniment of one film was fed to a hidden loudspeaker placed midway 
between the films. The babies reliably preferred to look at the movie cor- 
responding to the sound source. OtHer research indicates that babies are 
bom with a tendency to turn tc z sound and visually search for something 
there (Field et al., 1980; Mendelson and Haith, 1976; Wertheimer, 1961). 

This integrative capacity extends beyond auditory and visual properties 
to include the sense of touch. In recent experiments, Gibson and Walker 
( 1984) gave one-month-old infants either a hard lucite cylinder or a lookalike 
soft sponge cylinder to explore with their mouths. The experimenter thai 
showed each infant both cylinders, squeezing the spongy cylinder in one 
hand and rotating the hard cylinder with the o. terhand. The infants preferred 
to look at whichever cylinder had not previously been explored orally, 
showing a capacity to integrate what they saw with what they had mouthed. 
Meltzoff and Borton (1979) reported similar findings with objects that were 
smooth or had tiny no $ on their surface. 

These findings establish two important points about the cognitive struc- 
ture that infants employ to interpret sensory input from objects: (a) They 
endow objects with properties, such as rigidity, that transcend sensory 
modality; and (b) infants can appreciate such properties even when they 
are not acting on the objects. 
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Von Hofsten (1980) provides further evidence that babies know things 
about objects before they can successfully act on them. Around four months 
of age, infants are able to reach out and grasp objects. At the same time, 
without having had the experience of successfully catching a moving object, 
they also anticipate trajectories correctly and move their hand toward the 
spot where a moving object will be. This would appear to require reckoning 
of the velocity and direction of the object, foreknowledge of the time that 
the arm movement will take, and ability to combine these in calculating an 
intercept. 

Piaget noted the considerable difficulty infants have with occluded ob- 
jects. When infants four to eight months old are shown an interesting object, 
they reach for and grasp it and <wen follow its fall to the floor. But they 
stop reaching or looking if the object disappears behind a barrier. Infants 
8 to 12 months old will seek and retrieve an object they see someone cover, 
but they show an odd tendency referred to as the 44 A not B error/' If the 
baby sees an object taken from behind one barrier, A, and, while (he baby 
watches, moved behind another barrier, B, the baby searches only behind 
the first barrier (A)! Piaget concluded that "the object is still not the same 
to the child as it is to us: a substantial body, individualized and displaced 
in space without depending on the action context in which it was inserted'" 
(Piaget, 1954:64). Recent research suggests that the A not B error may be 
confined to particular experimental situations (Bjork and Cummings, 1984; 
Sophian, 1984). After all, if babies can match what they mouth with what 
they see, distinguishing between solid and spongy substances, they must 
be sensitive to objects as substantial bodies. 

As further evidence for this vie ? ', Baillargeon et al. (in press) showed 
five-month-old infants a screen that rotated toward and away from them 
through an arc of 180 degrees. Once the babies were habituated to the 
rotating screen, a yellow cube was placed alongside the screen for two trials 
of viewing. Then the cube was placed behind the screen; on alternating 
trials, the infant saw either a screen that once again rotated through the full 
180-degree arc (and at least from the adult perspective seemed to crush the 
covered object) or a screen that rotated through only a 120-degree arc, 
stopping at the angle at which its further rotation would be blocked by the 
presence of a solid object behind the screen. (One-way mirrors and varied 
lighting accomplished the visual effects.) Although 4e infants had previ- 
ously habituated to the full rather than partial rotation, they nonetheless 
looked longer at the full rotation, treating the habituated event as even more 
novel than one they had never se^n before. These results suggest the babies 
expect solid objects to persist even when no longer in sight. 

In short, considerable research with young infants has shown that they 
treat objects and events as sources for multiple kinds of sensory input, and 
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that they recognize in objects properties such as rigidity and solidity that 
transcend specific sensory modalitier. 

Abstract Concepts 

Many theories of cognition have assumed that lanf ~ge is necessary to 
abstract properties common to a set of objects. Premack (1976) tellingly 
refuted this thesis when he rhowed that once a chimpanzee had learned the 
symbol for apple, it could apply that symbol to various parts of an apple 
(seed, peel, etc.). Preverbal human infants also recognize properties com- 
mon to sets of nonidentical objects. Ross (1980), for example, habituated 
one- and two-year olds to one of five classes of items: O shapes, M shapes, 
furniture, men, and food. Then children were shown another item from the 
same class or an item from a novel class. They preferred the item from the 
novel class. The children's ability to recognize category membership was 
uncorrelated with their ability to supply a verbal label for the category. 

Number is a property of sets divorced from any description of the objects 
themselves. Hence, it is often treated as the ultimate in abstraction. A 
variety of results indicate that infants abstract number from visual displays 
of two, three, and sometimes four items (Starkey and Cooper, 1980; Starkey 
et al., in press; Strauss and Curtis, 19S i). For example, six- to n:ne-month- 
old infants became habituated to color photographs of either two or three 
assorted common household items, e.g., sponge, cloth, vase, comb, apple, 
etc. (each trial displayed different items). Infants who were habituated to 
two-object displays then looked longer at three-item ones, and vice versa. 
Infants even abstract number intermodally (Starkey et al., 1983). They 
prefer to look at the one of two displays that matches the number of 
drumbeats (two or three) they hear emanating from a centrally placed loud- 
speaker. 

Summary 

We have sampled the evidence that infants are not passive, Mnstructureo 
receivers of environmental input. Scon after birth they reveal an impress^ e 
degree of implicit conce t *ual structure allied to active learning endeavors. 
They behave as if they recognize that objects are independent of themselves, 
having size and solidity, and are specified intermodally. They reveal sen- 
sitivity to some properties of moving objects and form ronce^ts about some 
abstract properties < * sets. It is rot at all obvious why infants bother to 
attend to the number of items they see or hear. But it looks as if human 
infants come prepared to learn quickly about objects and certain concepts, 
including number. These early competences provide a base from which 
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much natural learning proceeds during the preschool ye?rs. Acquisition of 
knowledge in these natural domains is guided as much by the availability 
of implicit structures and principles that guide the child's active learning 
about the nature of objects and events, causes, number, etc. as by the 
availability of a supportive environment. 



Preschool thought and its development ?re much influenced by implicit 
knowledge of fundamental principles governing the determination and ma- 
nipulation of numbers, the character of physical causality, and the differ- 
ences between animate and inanimate objects. 

Number Many preschoolers spontaneously count collections soon after 
they learn to talk. Gelman and Gallistel (1978) propose that even these very 
young children make implicit use of some, if not all, of five principles of 
counting: (1) The tags used in counting must be placed in one-to-one cor- 
respondence with the items counted; (2) the tags must be drawn in order 
from a stably ordered list; (3) the last tag used represents the number in 
the set (cardinality); (4) the order in which the items are tagged is irrelevant; 
and (5) sets of arbitrary composition may be counted. What evidence is 
there for this view? 

First, counting behaviors in young children are systematic, even when 
they use nonstandard tags or orderings. For example, Gelman and Gallistel 
(1978) report a two-and-one-half year old who said "one, two" when 
counting a two-item array and "one, two, six" when counting a three-item 
array (the one-one principle). The same child used her own list over and 
over again (stable order principle) and repeated her last tag when asked 
how many items she had (the cardinality principle). Such nonstandard lists 
in counting are like the systematic errors made by young language learne* 
(e.g., "I runned"); just as the occurrence of such language errors implies 
use of language rules by the very young, so the occurrence of stable non- 
standard lists can be taken as evidence of implicit counting principles. 
Further evidence for implicit counting principles is found in the fact that 
young children spontaneously self-correct their own and others' counting 
errors (Gelman and Meek, 1983) and often are inclined to count without 
any request to do so. Such behaviors point to a representation that monitors 
and motivates performance (Greeno et al., 1984). 

Other studies have shown that preschool * ildren solve simple arithmetic 
problems by using cot ting strategies they invent (Groen and Resnick, 



PRESCHOOL THOUGHT 



Principles About Numbers, Causes, and Oojects 



ISO 



CHANGING VIEWS OF COGNITIVE COMPETENCE IN WE YOUNG 



133 



1977; Siegler and Robinson, 1982). To illustrate, Groen and Resnick (1977) 
taught four- and five-year-old children to solve addition problems of the 
form x + y = ? by counting out x blocks, counting out y more blocks 
and then counting the combined set. Children who practiced their addition 
over several weeks got better. More surprising is that over half of them 
invented a better way of solving the problems; counting on from whichever 
was the larger of the two values in the problem. To account for such 
inventions, it is necessary to postulate the use of something like an implicit 
principle of commutativity. 

Finally, the preschool child also understands that addition and subtrac- 
tion, unlike displacement, rearrangement, or item substitution, alter nu- 
merosity. This has been shown in 4 'magic" experiments where a child is 
confronted with unexpected alterations in the sets used in a kind of shell 
game (Gelman, 1977). In these experiments children between the ages of 
three and five first learn to find plates holding different numbers of objects, 
e.g., two and three, underneath each of two cans. Then they discover 
surreptitious changes in the number, type, or arrangement of items in one 
array. Those children who encountered irrelevant changes deemed them 
such and those who encountered the effects of relevant transformations 
pronounced them relevant. For example, changes in number elicited con- 
siderable surprise, e.g., "Eeeeee, how did that happen?" Further, the 
children postulated the relevant transformation, e.g., "One gone — Jesus 
Christ came and took it." They also could indicate what number they 
expected, what number they actually ercountered, and what arithmetic 
operation would have to be performed to "fix" the game — in this case, 
addition. 

Hence we see early implicit understanding of number, addition, and 
subtraction. We will later ask why this competence d^cs not guarantee easy 
learning of mathematics in school. 

Causality The suggestion that young children work with implicit notions 
of cause will surprise those familiar with Piaget's work on the development 
of die child's conception of causality. In one set of inquiries, Piaget asked 
children to explain a variety of natural and mechanical phenomena, e.g., 
the cycle of the moon, floating objects, the movement of clouds, the op- 
eration of steam engines, and bicycles, etc. Analysis of the explanations 
led Piaget to characterize the young child's thought as fundamentally pre- 
causal. He wrote, "Immediacy of relations and absence of intermediaries 
... are the two outstanding features of causality around the age of four- 
five" (Piaget, 1980:268). 

Piaget's conclusion that a concern for mechanism is completely lacking 
in the preschooler is contradicted by several later lines of experimental 
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research. For example, Shultz (1982) showed two-year-olds the cause-effect 
sequence of turning on a blower that then blew out a candle. He then 
showed them two blowers, each surrounded on three sides by a plexiglass 
shield, the critical difference being whether the open side was facing the 
candle. If considerations of mechanism did n.it influence young children, 
they would choose randomly between the two blowers a> potential causes 
of extinguishing the candle. Instead they systematicJly ;hose the unblocked 
blower. Similar findings were reported for the transmission of sound from 
a toning fork o. light from a oattrry. Preschoolers co; intently took note 
of barriers that w^uld stop 'lie transmission of prerequisite energy. Com- 
parable results were ootained by Shultz with schooled and unschooled Mali 
children ; n West Africa. 

Other lines of evidence support the conclusion that preschoolers on many 
occasions do reveal an implicit concern for cause. Hood and Bloom (1979) 
note a ubiquitous tendency for children to seek causal accounts of what 
happens a-.d how things work. Bullock (in press) showed that young children 
distinguish plausible from implausible mechanisms. At the start of her 
experiment, a rolling steel ball and a rolling light (produced as is the moving 
light effect in z movie marquee) moved simultaneously down parallel run- 
ways and disappeared together at the same time into an adjoining box. 
After a brief delay Snoopy jumped out of the box. Children were asked ;> 
identify the cause of Snoopy 's jumping. They reliably named the ball. Since 
both preceding events were coterminous and redundant, the children should 
have shown no preference for one event over the other if they considered 
only temporal and spatial contiguity when reasoning about causality. Their 
preference for the ball (an object with momentum and kinetic energy) can 
be taken as evidence that they were concerned with plausibility of mech- 
anism. 

Findings like these have led many (e.g., Bullock et al., 1982; Koslowski 
et al., 1981; Shdtz, 1°82) to a view that preschool children work with a 
set of implicit assumptions about physical causality, including the crucial 
one that mechanisms mediate cause and effect relations. Guided by these 
implicit assumptions, they learn rapidly about their world; but, as we shall 
see, this does not guarantee the acquisition of scientifically correct theories. 

Objects An early concern for mechanism may explain why preschool 
children are able to separate animate and inanimate objects (Carey, 1985a; 
Geiman et al., 1983; Keil, 1979). For example, three- to five-year-olds, 
asked whether a rock, a doll, and a person could walk, typically answered 
that a rock cannot walk because it has no feet; that a doll cannot walk unless 
someone pushes it, because its feet are only pretend; and that people can 
walk by themselves. In other words, inanimate objects cannot cause them- 
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selves to move, whereas animate ones can. Even infants treat animate and 
inanimate objects as belonging to different categories; they become very 
upset when a human stands still and fails to respond to them (e.g., Field, 
1978; Tronick et al., 1975) but do not do so in the presence of inanimate 
objects. These findings have led theorists to postulate that humans are 
disposed to treat animate and inanimate objects separately at birth. Since 
infants respond differently to moving malleable objects than moving solid 
objects, this may reflect early recognition of fundamental differences in the 
way animate and inanimate objects move. 



This review of preschoolers' knowledge of numbers, causes, and objects 
only scratches the surface of evidence that the young are more competent 
than we once presumed. For example, there is compelling evidence that 
preschoolers' interest in and recall of stories reflects the availability of story 
grammars (Mandler, 1983; Stein and Trabasso, 1982); that preschoolers 
can systematically classify (Rosch et al., 1976); that they can be logical 
(Braine and Rumain, 1983); that they represent knowledge with a variety 
of coherent structures (Keil, 1981; Markman, 1981; Nelson and Gruendel, 
1981); that children this age can take account of the perspective of an 
observer other than themselves (Lempers et al., 1977; Shatz, 1978); and 
even that congenitally blind children have Euclidean representations of 
space (Landau et al., 1981). 

Given all this evidence, it should not be surprising to discover that 
preschool children are often strategic and planful when acquiring knowledge 
structures. This was not always assumed, however. The dominant devel- 
opmental theories of the 1960s argued that a major shift in the quality of 
children's learning occurred between the ages of five and seven years; prior 
to that shift, children's learning was seen as primarily nonstrategic, passive, 
and context dependent; only after the shift was children's learning thought 
to become increasingly strategic, active, and flexible. 

These ideas were not advanced in the abstract. An enormous empirical 
base backed them up (Stevenson, 1970; White, 1965, 1970). but much of 
this data base was built using experimental designs that were not suitable 
for young children. Given cognitive exercises designed for school-age stu- 
dents, preschoolers typically performed abysmally, if at all, thus confirming 
theoretical claims of their incompetence. 

Systematic attempts to find more suitable ways to test the competence 
of younger children began in the 1970s (for a discussion see Brown and 
Deloache, 1978; Donaldson, 1978; Gelman, 1978). We will present some 
selections from the growing evidence that preschool children behave stra- 
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tegically, often direct their own learning, and actively create, test, and 
refine their own theories of the world around them. Driving this reassess- 
ment of preschool learning has been a greater consideration of the context 
in which it is observed. 

Strategies for Remembering A great deal of research in the 1960s and 
early 1970s was concerned with the development of school-age children's 
strategies for enhancing memory (Brown, 1975; Flavell, 1970a, 1970b; 
Kail and Hagen, 1977). A central theme was that preschool children 'vould 
differ from grade-school children on tasks that demand a great deal of 
strategic ingenuity. Young children, failing to devise strategic plans, would 
be at a considerable disadvantage on tests of deliberate memory, whereas 
older children woulc display increasing competence, primarily because they 
deploy more and more effective learning strategies. But laboratory and 
school tests of deliberate memory do not translate readily into the contexts 
in which young children naturally practice their emergent retention skills. 
That four-year-olds tend to be at a loss when asked to reproduce lists of 
digits or letters does not mean that they completely lack the ability to plan 
for future memory demands. 

How, for example, could one reconcile the diagnosis of nonstrategic 
learning with the following description of three-year-olds anticipating a 
memory test? Surreptitiously observing children as they attempted to re- 
member which of several containers concealed a toy dog, Wellman et al. 
(1975) found clear evidence of rehearsal (looking at the target container 
and nodding yes, looking at the nontarget containers and nodding no), 
retrieval cueing (resting their hands on the correct container or moving it 
to a salient position), and focused attention (looking fixedly at the correct 
hiding place). The children refused to be distracted until they were permitted 
to retrieve the lost dog. These efforts were rewarded: children who prepared 
actively for retrieval did remember better. 

Dei^oache et al. (1985) found even earlier evidence of planning for future 
retrieval. Children 18 to 24 months old were observed playing a hide-and- 
seek game; an attractive toy (Big Bird) was hidden in a variety of locations 
in a laboratory waiting room, such as behind a pillow on a couch. A timer 
was set to indicate the retrieval interval of, for example, five minutes; when 
the bell rang, the child could retrieve the toy. Far from waiting passively, 
the children interrupted their play to engage in activities indicating they 
were still preoccupied with the memory task: talking about the toy, pointing 
to the hiding place, or attempting an illegal peek. The children did not 
engage in these "keep-alive" activities if the toy remained partially visible 
during the retention interval or if the experimenter was responsible for 
remembering the location. Many other examples of early strategic com- 
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petence could be cited; precocious strategic competence is not limited to 
attempts at remembering. 

Theory-building Typecasting the young child as a passive learner also 
leads to a view of the young as being dependent on others for opportunities 
to learn, most notably parents, peers, or teachers. Some have argued that 
much, if not most, cognitive growth is a result of children internalizing 
cognitive activities that they originally witness in others (Laboratory of 
Comparative Human Cognition, 1983; Rogoff and Wertsch, 1984). 

Interesting and important though guided learning situations may be, it is 
clear that much of the time children are also ;tively involved in orches- 
trating their own learning. Children learn in situations where there is no 
obvious guidance, no feedback other than their own satisfaction, and no 
external pressure to improve or change. They act like scientists, creating 
theories-in-action (Karmiloff-Smith, 1985) that they challenge, extend, and 
modify quite on their own. The child is not only a problem-solver but a 
problem-creatcr, a metaphor much in keeping with scientific thinking. 

Some of the best evidence of self-motivated learning comes from situ- 
ations in which children are observed as they operate on a problem, over 
considerable periods of time, quite without external pressure, seemingly 
with no motivation other than to improve the theory on which they are 
working. Consider the behavior of 24- to 48-month-old children engaged 
in free play with - set of nested cups (DeLoache, Sugarman, and Brown, 
in press). Although the children saw the cups nested before they began to 
play, there was no real need to renest them; however, they did so, working 
long and hard in the process. 

The most primitive activity, used frequently by children younger than 
30 months, was brute force. When a large cup was placed on a smaller 
one, the children would repeatedly twist, bang, or press down hard on the 
nonfitting cup. A second approach used by some of the younger children 
was that of local correction. After placing two nonfitting cups together, 
the child separated them and tried to find a replacement for only one cup, 
a minimal restructuring involving the relation between only two cups at a 
time. A third characteristic ploy of children younger than 30 months was 
to dismantle the entire set and start again whenever a cup did not fit. 

Older children (30 to 42 months) faced with a nonfitting cup engaged in 
strategies that involved consideration of the entire set of relations in the 
stack. For example, one sophisticated strategy was insertion; the children 
took apart the stack at a point that enabled them to insert a new cup correctly. 
A second strategy, reversal, was also shown by older children. After placing 
two nonfitting cups together, the child would immediately reverse the re- 
lation between them (5/4 immediately switched to 4/5). 
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The rapidly executed reversal strategy was not shown by the younger 
group. Some young children would repeatedly assemble, for example, cups 
4-1, starting with 4 as a base and then inserting 3, 2, I. Then they en- 
countered the largest cup, that is, 5, and attempted to insert it on top of 
the completed partial stack, pressing and twisting repeatedly. When brute 
force failed, they would dismantle the whole stack and start again. Similarly, 
having assembled I, 2, 4, and 5, and then encountering 3, the younger 
children's only recoarse was to begin again. 

The young learners progress from piecemeal activities and local fixup 
ploys to a thoughtful consideration of the relation among elements of the 
whole problem. There is evidence that this progression reflects a general 
learning mechanism in action that children of many ages use when faced 
with novel construction problems. A similar progression is seen in older 
(four to seven years) children attempting to construct a railway circuit 
(Karmiloff-Smith, 1979) and even in adolescents refining the processes of 
written composition (Scardamalia, 1984). It is also important to note that 
the development 0 n any one task is not completely age-governed in that 
children left to work on the problem over short periods of time (hours, 
days, etc.) show the same developmental progression from immature to 
mature activities that characterizes the cross-age descriptions of initial at- 
tacks on the problem (Brown, Kane, and DeLoache, work in progress; 
Karmiloff-Smith and Inhelder, 1974-1975). 

In most of the above examples, children left to work with the problem 
unaided create solutions, modify their own answers, correct their errors, 
and develop more mature strategies on their own. Perhaps more impressive 
cases are those in which children persist after an adequate solution has been 
reached. Reorganization and improvement in strategies is not solely a re- 
sponse to failure, but often occurs when the child seeks to improve quite 
adequate functioning procedures. In these cases, it is not failure that directs 
change but success that the child wishes to refine and extend. 

Consider, for example, the group of four- to seven-year-olds who were 
asked to balance rectangular wooden blocks on a narrow metal rod (Kar- 
miloff-Smith and Inhelder, 1974-1975). These were no ordinary blocks, 
however. Standard blocks had their weight evenly distributed, and could 
therefore be balanced at the geometric center. Weighted blocks had the 
weight of each * * side' 9 varied either conspicuously (by gluing a large square 
block to one end of the base rectangular block) or inconspicuously (by 
inserting a hidden weight into a cavity on one end); the geometric center 
rule would not work for these blocks. 

At first, the children made the blocks balance by brute trial and error. 
This ploy was obviously successful; the children balanced each block in 
turn. This early errorless but unanalyzed phase was spontaneously sup- 
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planted by the emergence of strong theories-in-action directed at uncovering 
the rules governing balance in the miniature world of these particular blocks. 
Unfortunately, they were incomplete hypotheses that produced errors. A 
common early theory was to concentrate exclusively on the geometric center 
and attempt to balance all blocks in this fashion. This works only for 
standard blocks; the weighted blocks were discarded as exceptions ( 4 'im- 
possible to balance"), even though the child had previously balanced them 
all. 

After this theory was well established, the child became discomfited by 
the number and regularity of errors. A new juxtaposed theory was then 
developed for conspicuously weighted blocks. For these, the children com- 
pensated for the weight that was obviously added to one end and adjusted 
the point of balance accordingly. For a time, however, length and weight 
were considered independently; standard blocks were balanced by the geo- 
metric center rule and conspicuously weighted blocks by the rule of "es- 
timate weight first and then compensate." Hidden weight problems still 
generated errors; these blocks looked identical to the standard ones and 
were therefore subjected to the geometric center rule; when Uiey did not 
conform, they were discarded as anomalies, "impossible to balance." 

Now the young theorists were made uncomfortable by the remaining 
exceptions and began to seek a rule for them. In so doing, a reorganization 
was induced that resulted in a single rule for all blocks. The children paused 
before balancing any block and roughly assessed the point of balance. Verbal 
responses reflected their consideration of both length and weight, e.g., 
"You have to be careful, sometimes it's just as heavy on each side and so 
the middle is right, and sometimes it's heavier on one side." After inferring 
the probable point of balance, and only then, did the child place the block 
on the bar. 

For all of these examples we can ask, why do children bother? Implicit 
in the situation is the goal that the cups should be nested, the railway 
constructed, or the blocks balanced; but the children are free to abandon 
their efforts whenever they like. They persist, however, for long periods, 
even in the face of frustration and even when an adequate partial solution 
has been reached. 

Pressure to work on adequate partial theories, to produce more encom- 
passing theories, is very similar to what occurs in scientific reasoning. Like 
the scientist, it is essential that the child first develop simple theories that 
they perfect and control before they entertain more encompassing complex 
hypotheses. Karmiloff-Smith and Inhelder refer to this as creative simpli- 
fication. By ignoring some of the complicating factors initially, the child 
can begin to construct theories that achieve partial success. Progress comes 
only when the inadequate partial theory is well established and the learner 
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attempts to extend the theory to other phenomena. In this way, children or 
scientists are able to discover new properties that in turn make it possible 
for new theories to be constructed. 

Summary 

The studies reviewed in this section make it clear that a universal di- 
agnosis of young children as passive learners, with little control of their 
own cognitive growth, does serious injustice to their ingenuity. Faced with 
problems to solve, where they are interested in the outcome and understand 
the goal, even two-year-oids behave like scientists, actively exploring the 
environment, testing theories in action, and modifying approaches to prob- 
lems as a result of experience. 

This is not to claim that two-year-olds possess problem-solving abilities 
comparable to those of the adult, or even of the eight-year-old. Nor is it 
to claim that preschool theory building is comparable to scientific reasoning 
perfected during the adolescent, college, and later years. Precursors of 
active, systematic problem solving emerge early in the child's life, but there 
are limits on the young child's theory building, and they can have consid- 
erable difficulty harnessing their natural proclivities in settings of formal 
education. These matters are taken up in the fourth section of this essay. 

THE TRANSITION TO FORMAL SCHOOLING 

Incomplete Knowledge 

Young children develop and test theories about the nature of objects, 
numbers, causality, etc., but these theories are implicit, partial, limited, 
and sometimes wrong. Their further development depends to some extent 
on the kind of structured input offered in school, input that makes these 
theories more precise and explicit. 

For example, young children sense that animate and inanimate objects 
differ, but a great deal more knowledge is needed to develop organized 
biological theories. Preschool children do not think of animals in the same 
way that older children and adults in this culture do, i.e., as sharing certain 
defining biological characteristics (Carey, 1985b). For example, if pre- 
school children are taught that a person has a stomach, they allow that other 
animals do as well, but inanimate objects do not. However, if they are 
instead taught that a dog has a stomach, they do not necessarily attribute 
this to people and other animals. Carey postulates that young children's 
theory of animates is based on their theory of people and only later on a 
biological theory. 
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In a similar vein, preschoolers 9 understanding of number extends to 
some but not all situations. For preschoolers, a number is what you get 
by counting objects in a set. This is a good theory to a degree, but it 
has limits; for example, it seems to hinder tne expansion of the children's 
number system to include zero and negative numbers. Zero is not a tag 
that one applies to an item in a set being counted; nor of course is minus 
one (Evans, 1983). 

Misconceptions about the centrality of counting must contribute to sys- 
tematic errors that elementary school children make in subtraction problems: 
they arc strongly inclined to subtract the smaller digit from the larger, and 
carrying across zero presents children with unusual difficulty (Brown and 
VanLehn, 1982; LindvaU and Ibarra, 1981). 

Another characteristic difference between the theories of the young and 
older individuals is in their explicitness. The theories entertained by pre- 
schoolers are almost always implicit; they cannot be articulated but never- 
theless seem to determine beliefs and actions in a given domain. To illustrate 
die power of an implicit theory: All English speakers have extensive implicit 
knowledge of English syntax, knowledge that constrains what we say and 
understand, but that, in the absence of linguistic instruction, we cannot 
articulate. No one ever says: "Who d»d John see Mary and?" but few can 
articulate the principle of syntax that this utterance violates. 

Education often serves to teach the child to make implicit theories explicit. 
But some theories that the learner is asked to master explicitly conflict with 
existing implicit theories. For example, theories of mechanics developed 
early in life may interfere with the lining of formal theories of mechanics. 
Infants, as we have seen, initiate hand movements to intercept objects on 
the implicit assumption that they will continue along curvilinear trajectories. 
McCloskey (1983) suggests that such extrapolations could contribute to a 
medieval impetus theory of moving objects. There is evidence that students 
even in high school and college have trouble assimilating Newtonian me- 
chanics because they convert what they are taught into something like the 
impetus theory. For example, a student who had completed college physics, 
asked to define momentum, replied: "... A combination of the velocity 
and the mass of an object. It's something that keeps a body moving." 
Clearly, he had not grasped the concept of inertia, but like the child or the 
medieval physicist, considered that some force is always required to keep 
an object moving at constant velocity. 

The spontaneous development of early knowledge structures makes it 
possible for infants and very young children to acquire rapidly a functional 
understanding of the world. Yet, these early theories can be two-edged 
swords; they may sometimes impede children's understanding of explicit 
theories encountered in the context of formal schooling. Knowing this, we 
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are in a better position to understand some of the problems children have 
in school. 



The Expansion of Strategic Powers 

Young children have considerable strategic skill. Still, they have a long 
way to go in meeting the demand* of literacy. Three-year-olds keeping 
alive their memory of a hidden toy seem to grasp the rudiments of rehearsal, 
but this does not mean that they know how to rehearse in a manner that 
would assist them in learning to spell, or remember historical facts or 
complex logical or mathematical relations. Gradual refinement and tuning 
of skills, together with a growing understanding of their function and range 
of utility, typifies the evolution of many school-relevant learning strategies. 
An example is skill at learning word lists. Two-year-olds display primitive 
precursors of rehearsal in their attempts to maintain memory of an object 
by naming, pointing, or eye fixation (DeLoache et al., 1985). By five years 
of age, children attempt to name (label) some of the items in a set some 
of the time (Flavell et al., 1966). Labeling and rote repetition of single 
items become well established during the early grade-school years (Craik 
and Watkins, 1973). With increasing sophistication, children then begin to 
place more items in their rehearsal sets, engaging in "cumulative re- 
hearsal." During the later primary and early secondary years there is con- 
tinual refinement of cumulative rehearsal, such as coordinating acquisition 
and retrieval components and increasingly attending to the size and com- 
position of rehearsal sets (Belmont and Butterfield, 1977). Adolescents use 
elaborated rehearsal: they become increasingly sensitive to the presence of 
conceptual organization in the to-be-remembered list and capitalize on this 
inherent structure whenever possible (Ornstein and Naus, 1978), a devel- 
opment necessary to moving from rehearsal of lists of items and paired 
associates (as in spelling and foreign language leaning) to the learning of 
whole segments of text. Adequate rehearsal strategies for studying do not 
appear until well into the high school years and are not perfected even by 
college students (Brown et al., 1983). 

Memory strategies are not the only forms of school-related learning that 
evolve gradually. Literacy also demands skills of exposition and commu- 
nication far beyond those expected of the preliterate child. Although young 
children can take their listeners' knowledge, perspective, and communi- 
cative competence into account when attempting to relay a simple message, 
schools demand much greater sophistication. The student is often required 
to communicate hazily understood material to an audience that does not 
share the same background knowledge and assumptions. Schools eventually 
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require that the student communicate in writing to an unseen, often unknown 
audience, remote in time and space. 

It is useful then to think of competence in terms of bandwidths, the lower 
end defined by the spontaneous learning of early childhood, the upper end 
defined by the- ever increasing demands of the literate and technological 
society served by the schools (Brown and Rc-ve, 1985). Early competence 
emerges in hospitable contexts that match well with the child's knowledge, 
interests, and goals: here we see the "tireless explorer" and the "knowledge 
seeker" (Chukovsky, 1971) in action, the "littV scientist" coming to un- 
derstand his world. In schools, however, the goals and contexts of learning 
cannot always be of the child's choosing. The goal of learning through 
spontaneous discovery cannot always be maintained, and students must 
require skills of learning for learning's sake. By its very nature, much of 
schooling must be divorced from the simple, readily understandable goals 
of play or work (Bruncr, 1972). Formal learning demands that students 
acquire knowledge without context, and even the preferred structuring of 
knowledge in temporal, spatial scripts, or story form, must be waived in 
favor of academic forms of organization by hierarchy and taxonomy (Man- 
dler, 1983). It should not be surprising that many children's natural learning 
proclivities are overwhelmed by the task of acquiring large amounts of 
decontextudized material, organized in nonpreferred modes, with demands 
for precision and processing capacity greater than is the esse in everyday 
life (Bartlett, 1958). 

School learners not only must acquire knowledge in specific domains, 
such as science and history, but they must also "learn how to learn," 
developing routines for studying in general. More than ever before, schools 
must equip people to deal with facts that they will encounter only after they 
leave school. In a scientific and technological society based on an increas- 
ingly complex and rapidly changing information base, a productive member 
of society must be able to acquire new facts, critically evaluate them, and 
adapt to their implications. Schools need to develop intelligent novices 
(Brown et al., 1983), those who, although they may not possess the back- 
ground knowledge needed in a new field, know how to go about gaining 
that knowledge. 

Formal and Informal Teaching 

It is not only the type of material to be learned that shifts in the school 
setting. There is also a substantial change in the teaching procedures com- 
pared with informal settings such as homes, preschools, or special interest 
clubs. In many cultures children are initiated inO adult work activities and 
literacy events without explicit formal instruction. Opportunities for learning 
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schooling by becoming somewhat passive learners. Habitual failure in ac- 
ademic settings erodes their feelings of personal competence. The additional 
burden of repeated evaluation and labeling that accompanies continued 
failure is even more damaging. Such children develop quite devastating 
diagnoses of their own capabilities, readily describing themselves as "dumb," 
"not good at school things," "too stupid to read," etc. They come to 
question their personal efficacy (Bandura, 1980) in school settings. Children 
who view themselves as inadequate in school, as nonstarters in the academic 
race, often develop compensatory coping strategies to preserve their feelings 
of self-worth in what they view as die less-than-hospitable environment of 
the classroom. 

Negative conceptions of one's prognosis for school success lead at best 
lodefensive "passing," "coping," or "managing" (Goffman, 1963). Cop- 
ing strategies include systematic devaluation of academic tasks and goals 
and the justification of lack of effort, i.e., "who needs to read anyways." 
Passing and managing tactics can be perfected so that the wily child avoids 
occasions of challenge. Threatening tests can be avoided if other children 
will cover; teachers wil? avoid embarrassment by not calling on the weaker 
child (Cole and Traupmann, 1980). All these ployj serve to defend against 
damaging expositions, attributions of failure, and further erosion of self- 
efficacy. These defenses are also formidable barriers to learning. Orienting 
one's attention and effort in school to minimizing demonstrations of failure 
rather than actively seeking occasions for acquiring new knowledge may 
be a realistic reaction to repeated obstacles, but it is not conducive to new 
learning. 

Failure-oriented children typically display a pattern of learned helpless- 
ness in the face of obstacles or errors (Seligman et al., 1971). This pattern 
also increases negative feelings and further deflates the prognosis for suc- 
cess. There is a concomitant degradation of learning strategies. Failure- 
oriented children attribute their errors to lack of ability and often view 
temporary failure as an indication of a stable, generalized incompetence 
("I'm dumb."). Helpless children question their ability in the face of 
obstacles, perceiving past successes to be few and irrelevant and future 
effort to be futile (Dweck and Bempechat, 1983). 

In contrast, mastery-oriented children treat obstacles as challenges to be 
overcome by perfecting one's learning strategies; they do not attribute a 
temporary setback to personal shortcomings. Their verbalizations following 
failure often consist of positive self-instruction: "Slow down," "try new 
tactics," "evaluate the task more systematically." Dweck and Bempschat 
(1983) argue that these different reactions to academic difficulties reflect 
whether the child conceives tasks in terms of performance goals, where 
competence is to be evaluated and perhaps fcand wanting, or learning goals, 
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where an opportunity exists to acquire new competences. Performance-goal 
children feel that they have been successful when they "don't make mis- 
takes," "get easy work/' etc. whereas learning-goal children feel suc- 
cessful when they master a new skill. 

Reawakening the Active Learner 

V be successful interventions with passive learners must reinstill the 
confidence necessary for self-directed learning. Wertime (1979) has argued 
that many students need help to increase their "courage spans," enabling 
them to treat failures as false starts or blind alleys that can be overcome 
and to regard errors as useful information. Students need to tolerate am- 
biguity, evaluate and judge information, and seek disaffirming evidence — 
in short, become critics and especially self-critics (Binet, 1909; Brown, 
1985). But this criticism must be constructive, mastery-oriented self- 
guidance rather than self-derogation. 

To end on a optimistic note we will illustrate two methods that have 
achieved some success at acclimatizing children to formal learning settings: 
(a) avoiding initial failures by adapting early school experiences to the prior 
competence of the entering child; and (b) lessening the gap between in- 
formal and formal teaching styles. 

An excellent example of matching classrooms to homes is Heath's work 
with poor black Appalachian kindergarten children entering classrooms of 
white middle-class teachers (Heath, 1981). Heath found systematic differ- 
ences between questioning behavior in the black and white communities 
she studied, particularly a mismatch between classroom questioning routines 
and spontaneous questioning activities in black preschoolers' environments. 
A common classroom routine is the "known-answer" question. Teachers 
routinely call on children to answer questions in order to display the chil- 
dren's knowledge rather than to provide information that the teacher does 
not have, which is the more familiar purpose of a question . These classroom 
questioning patterns do not map well into the earlier experiences of many 
children who lack informal exposure to academic language games. 

At the beginning of the study Heath found that teachers were bewildered 
by the lack of responsiveness of their black pupils. For example: "They 
don't seem to be able to answer even the simplest questions." "I would 
almost think they have a hearing problem; it's as if they don't hear me ask 
a question. " "I sometimes feel that when I look at them and ask a question, 
I'm staring at a wall I can't break through" (Heath, 1981:108). 

Heath shared with teachers her documentation of the types of preschool 
questioning these children were familiar with, such as metaphoric and 
narrative sequences, and encouraged them to engineer settings that evoked 
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the children's competence in the familiar format. Having practiced familiar 
questioning rituals, the teachers were then able to introduce the unfamiliar 
known-answer routines with great success. Another case of easing the 
transition to formal schooling, by capitalizing on the children's strengths 
rather than exposing their weaknesses, is the remarkable gains in reading 
achievemeut shown by Native Hawaiian (Polynesian) children after reading 
lessons were set in the context of a familiar Hawaiian interactive game, 
"talk-story" (Au, 1980). 

Anouer successful intervention ploy is to lessen me gap between informal 
and formal learning settings. As we have seen, natural tutoring involves 
modeling on the part of the teacher and a gradual transfer of responsibility 
to the novices when and if they are ready to take control of their own 
learning. Instructional routines that mimic natural tutoring sessions are 
proving quite successful For example, junior high school "passive" learn- 
ers with depressed reading comprehension scores were moved from tradi- 
tional instruction to a reciprocal teaching environment based on theories of 
natural tutoring. In reciprocal teaching, students of varying levels of com- 
petence and an adult teacher take turns "being the teacher," that is, leading 
a dialogue on a segment of text they are jointly attempting to understand 
and remember. The teacher responsible for a particular segment of text 
leads the ensuing dialogue by stating the gist i*i his or her own words, 
posing a question, clarifying any misunderstandings, and predicting what 
might happen next. All of these activities are part of a natural dialogue 
between the adult teacher and students. If a student has difficulty with any 
component of the dialogue, the teacher provides modeling and feedback at 
the student's current level, gradually leading each student to independent 
competence. Examples of such gradual transfer of responsibility can be 
found in Palincsar and Brown (1984). 

Reciprocal teaching is based <r certain central principles of effective 
learning: (1) the teacher models the desired comprehension activities, thereby 
making underlying processes overt, explicit, and concrete; (2) the teacher 
demonstrates the activities in appropriate contexts, not as isolated decon- 
textualized skills; (3) the students are fully informed of the need for strategic 
intervention and the range of utility of a particular strategy; (4) the students 
see immediately that the use of strategies works for them; (5) the respon- 
sibility for the comprehension activities is transferred to the students as 
soon as they can take charge of their own learning; (6) this transfer of 
responsibility is gradual, presenting students with a comfortable challenge; 
and (7) feedback is tailored to the students' existing levels, encouraging 
them to progress one more step toward competence. 

The reciprocal teaching procedure involves continuous trial and error on 
the part of the student, coupled with continuous adjustment on the part of 
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the teacher to the student's current competence. Through interactions 
with the supportive teacher and their more knowledgeable peers, the 
students are led to perform at an increasingly more mature level; some- 
times this progress is fast, sometimes slow, but, irrespective of the rate, 
the teacher provides an opportunity for the students to respond at a slightly 
challenging level. As the students master one level of involvement, the 
teacher increases his demands so that the students are gradually called 
upon to adopt the adult role fully and independently. The teacher then 
fades into the background as the students take charge of their own learning 
from texts. 

The results of the reciprocal teaching intervention with junior high school- 
ers were dramatic. The students improved their ability to clarify, predict, 
summarize, and ask questions. Consider the quality of the summaries; these 
seventh-grade students initially produced summaries ranked inadequate even 
by the standards set by fifth graders. At the end of two weeks of daily 
reciprocal teaching sessions, they were able to produce quite acceptable 
inventions, i.e., summaries, in their own words, of the gist of a particular 
dialogue. A predominance of inventions characterizes the untrained sum- 
marization performance of college freshmen (Brown and Day, 1983). Thus, 
guided instruction had taken these failing seventh graders to a level of 
competence far beyond that typical *br their peers. Furthermore, they also 
became able to assume the role of teacher, producing their own questions 
and summaries and evaluating those of others. In addition, there were 
significant improvements in independent performance on laboratory, class- 
room, and standardized tests of comprehension. But perhaps more impor- 
tantly, the children's feelings of personal competence and control improved 
dramatically. Allowed to take charge of the dialogues, and even tutor less 
advanced students, these "failing" students increased their courage as well 
as their purely cognitive skills. Success bred positive expectations from 
teachers and improved students' personal "efficacy," i.e,, the confidence 
to employ active learning strategies in the belief that they will woric. 

It is important to note that mimicking natural tutoring styles has proved 
a successful instructional technique in areas other than reading: listening 
comprehension (Brown and Palincsar, in press), writing (Applebee and 
Langer, 1983;Scaniamalia, 1984), storytelling (McNamee, 1981), studying 
(Frase and Schwartz, 1976), and problem solving (Bloom and Broder, 1950) 
have all responded well to reciprocal instruction strategies. In addition, it 
is not only teachers who can serve as the agent of change but also mothers 
(NinioandBruner, 1978; Saxeetal., 1984; Scollon, 1976; Wertsch, 1979), 
peers (Bloom and Broder, 1950; Whimbey and Lochhead, 1982), and even 
somewhat intelligent computer tutors (Brown et al., 1982; Heller and Hun- 
gate, 1984; Lesgold and Reif, 1983). The concept of expert scaffolding, 
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tl i gradually guided transfer of learning responsibility from an expert to a 
novice, has wide applicability as an instructional philosophy. 



Recognition of children's natural competence, both in terms of strategic 
rules and knowledge, is having a profound effect on instructional theory. 
Structured instruction, however, is necessary for the child to go beyond 
imprecise, and sometimes erroneous, implicit theories and to acquire the 
precise, explicit theories that constitute formal knowledge. Through the 
intervention of certain forms of formal schooling, children are turned into 
routine school experts (Hatano, 1982), able to perform, more and more 
efficiently, the procedures taught and practiced in schools. 

One problem, however, is that routine expertise can lead to the requisition 
of "inert knowledge" (Whitehead, 1916), acquired by rote learning and 
practice but rarely used flexibly and creatively. Educational systems that 
promote adaptive expertise (Hatano, 1982), whereby students come to un- 
derstand, challenge, and flexibly apply their knowledge, depend on main- 
taining the active thirst for knowledge that the preschool child brings initially 
into settings of formal education. The more we learn about the knowledge 
structures that children bring to school and the instructional practices that 
foster their natural proclivities to build and refine theories, the more able 
we will be to design instructional modes that promote adaptive expertise 
rather than the acquisition of inert knowledge. 



In this chapter we have concentrated on an apparent pr-^ox concerning 
the cognitive competence of children. Recent research wL* miants and very 
young children suggests that they know far more about their world initially, 
and develop this understanding more rapidly, than was previously supposed. 
However, topical consternation ove; the putatively increasing incompetence 
of school-aged children in academic settings stands in sharp contrast to 
these claims of early ingenuity. 

In the first part of the chapter, we discussed the necessity of granting 
complex cognitive structures to the young human mind. This breaking away 
from an empiricist account of human thought took its impetus from sweeping 
changes in psychological theory pioneered notably by Chomsky, the Gib- 
sons, and Piaget. Buttressing these theoretical claims is a body of contem- 
porary research gleaned from a variety of ingenious techniques that make 
it increasingly feasible to interrogate infants. The outcome of a painstaking 
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set of inquiries is a window through which we can view the young child's 
cognitive world, a window that is only beginning tc )p n . 

We now know that infants arc sensitive to certain ; .1 .es of movement 
early in life; that they garner mutiisensory and multimodal information about 
the nature of objects; that they endow objects with properties of rigidity and 
solidity; and that they possess rudimentary theories of categories, recog- 
nizing properties of sets of nonidentical objects, including numerosity, a 
property of sets divorced from any description of the objects themselves. 
Implicit principles of causality, numerosity, etc. guide the development of 
such knowledge at a rapid pace during the preschool years, a time during 
which chi'dren are busily engaged in exploring their environment. Char- 
acterized as "tireless explorers," they invent primitive but serviceable 
comprehension, learning, and memory strategies, and create and test con- 
tinously evolving theories to breathe meaning into their physical and social 
world. 

The pace of this development sesms to slow down during the school 
years, but this may be because children's competence is increasingly viewed 
in the light of their performance on academic tests. Learning in schools 
differs from natural learning in that others are in charge of what must be 
learned, others control the timetable, and students must develop interest 
and skill in learning for learning's sake so that they can intentionally set 
about acquiring large bodies of decontextualized knowledge. 

In an increasingly complex and rapidly changing technological society, 
more than ever before, students must be equipped to acquire new infor- 
mation, critically evaluate it, and adapt to its implications. They must learn 
to waive their imprecise theories in favor of the precise, explicit, more 
encompassing theories that constitute formal knowledge. Profound theory 
change of this magnitude comes at a cost that many may be reluctant to 
pay without a supportive academic environment. In the latter part of the 
chapter, we discussed innovative pedagogical procedures that serve to main- 
tain and bolster the child's natural curiosity and theory-building capacities. 
In the exploitation of such techniques lies hope for solving the paradox of 
early competence and later academic crisis. 
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on Language Behavior 



MICHAEL STUDDERT-KENNEDY 



INTRODUCTION 

Fifty years ago the study of language was largely a descriptive endeavor, 
grounded in the traditions of nineteenth century European philology. The 
object of study, as proposed Ly de Saussure in a famous course of lectures 
at the University of Geneva ( 1906- 19 1 1 ), was langue, language as a system, 
a cultural institution, rather than parole, language as spoken and heard by 
individuals. In 1933 historical linguists were describing and comparing the 
world's languages, tracing their family relations, and reconstructing the 
protolanguages from which they had sprung (Lehmann, 1973). Structural 
linguists were developing objective procedures for analyzing the sound 
patterns and syntax of a language, according to well-defined, systematic 
principles (e.g., Bloomfield, 1933). Students of dialect were applying such 
procedures to construct atlases of dialect geography (Kurath, 1939), while 
anthropological linguists were applying them to American Indian, African, 
Asian, Polynesian, and many other languages (Lehmann, 1973). The work 
goes on. From it we are coming to understand the origins of language 
diversity: not only how languages change over time and space but also how 
they and their dialects act as forces of social cohesion and differentiation 
(e.g., Labov, 1972). 

However, the unfolding of the descriptive tradition and the development 
of new methods and theories in the field of sociolinguistics are not my 
concerns in this chapter. My concern, rather, is with a view of language 
that has emerged from a more diverse tradition. For like the taxonomic 
studies of Linnaeus in botany and of his followers in zoology, the great 
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labor of language description and classification has provided the raw ma- 
terial for a broader science, stemming from the work of seventeenth century 
grammarians and of such nineteenth century figures as the German physicist 
Hermann von Helmholtz, the French neurologist Paul Broca, and the En- 
glish phonetician Henry Sweet. The several strands that their works rep- 
resent have come together over the past 30 to 40 years to form the basis 
of a new science of language, focusing on the individual, rather than on 
the social and cultural, linguistic system. Since the new focus is essentially 
biological, a biological analogy may be helpful. It is as though we shifted 
from describing and classifying the distinctive flight patterns of the world's 
eight or nine thousand species of birds to analyzing the basic principles of 
individual flight as they must be instantiated in the anatomy and physiology 
of every hummingbird and condor. Thus, this new science of language 
asks: What is language as a category of individual behavior? How does it 
differ from other systems of animal communication? What uo individuals 
know when they know a language? What cognitive, perceptual, and motor 
capacities must they have to speak, hear, and understand a language? How 
do these capacities derive from their biophysical structures, that is, from 
human anatomy and physiology? What is the course of their ontogenetic 
development? And so on. 

Such questions hardly fall within the province of a single discipline. 
The new field is markedly interdisciplinary and addresses questions of 
practical application as readily as questions of pure theory or knowledge. 
Linguistics, anthropology, psychology, biology, neuropsychology, neu- 
rology, and communications engineering all contribute to the field, and 
their research has implications for workers in many areas of social import: 
doctors and therapists treating stroke victims, surgeons operating on the 
brain, applied engineers working on human-machine communication, 
teachers of second languages, of reading, and of the deaf and otherwise 
language-handicapped. 

The origins of the new science are an object lesson in the interplay 
between basic and applied research, and between research and theory. To 
understand this, we must begin by briefly examining the nature of language 
and the properties that make it unique as a system of communication. 

The Structure of Language 

If we compare language with other animal communication systems, we 
are struck by its breadth of reference. The signals of other animals form a 
closed set with specific, invariant meanings (Wilson, 1975). The ultrasonic 
squeaks of a young lemming denote alarm; the swinging steps and lifted 
tail of the male baboon summon his troop to follow; the "song" of the 
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male white-crowned sparrow informs his fellows of his species, sex, local 
origin, personal identity, and readiness to breed or fight. Even the elaborate 
"dance" of the honeybee merely conveys information about the direction, 
distance, and quality of a nectar trove. But language can convey information 
about many mere matters than these. In fact, it is the peculiar property of 
language to set no limit on the meanings it can carry. 

How does language achieve this openness, or productivity? There are 
several key features to its design (Hockett, 1960). Here we note two. First, 
language is learned: it develops under the control of an open rather than a 
closed genetic program (Mayr, 1974). Transmission of the code from one 
generation to the next is therefore discontinuous; each individual recreates 
the system for himself. There is ample room here for creative variation — 
probably a central factor in the evolution of language and in the constant 
processes of change that all languages undergo (e.g., Kiparsky, 1968; 
Locke, 1983; Slobin, 1980). One incidental consequence of this freedom 
is that the universal properties of language (whatever they may be) arc 
largely masked by the surface variety of the several thousand languages, 
and their many dialects, now spoken in the world. 

Second, and more crucially, language has two hierarchically related levels 
of structure. One level, that of sound pattern, permits the growth of a large 
lexicon; the other level, that of syntax, permits the formation of an infinitely 
large set of utterances. A similar combinatorial principle underlies the 
structure of both levels. 

Consider, first, the fact that a six-year-old, middle class American child 
typically has a recognition vocabulary of some 8,000 root words, some 
14,000 words in all (Templin, 1957). Most of these have been learned in 
the previous four years, at a rate of about five or six roots a day. As an 
adult, the child may come to have a vocabulary of well over 150,000 words 
(Seashore and Erickson, 1940). How is it possible to produce and perceive 
so many distinct signals? 

The achievement evidently rests on the evolution in our hominid ancestors 
of a combinatorial principle by which a small set of meaningless elements 
(phone, ics, or consonants and vowels) is repeatedly sampled, and the sam- 
ples permuted, to form a very large set of meaningful elements (morphemes, 
words). Most languages have between 20 to 100 phonemes; English has 
about 40, depending on dialect. The phonemes themselves are formed from 
an even sm:!ler set of movements, or gestures, made by jaw, lips, tongue, 
velum (soft palate), and larynx. Thus, the combinatorial principle was a 
biologically unique development that provided ' 'a kind of impedance match 
between an open-ended set of meaningful symbols and a decidedly limited 
set of signaling devices 1 ' (Studdert-Kennedy and Lane, 1980; cf. Cooper, 
1972; Libermanetal., 1967). We may note, incidentally, thata large lexicon 
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is not peculiar to complex, literate societies: even s'-^IIed primitive human 
groups may deploy a considerable lexicon. Foi example, the Hanunoo, a 
stone age people of the Philippines, have nearly three thousand words for 
the flora and fauna of their world (Levi-Strauss, 1966). 

Of course, a large lexicon is not a language. Many languages have 
relatively small lexicons, and in everyday speech we may draw habitually 
on no more than a few thousand words (Miller, 1951). To put words to 
linguistic use, we must combine them in particular ways. Every language 
has a set of rules and devices, its syntax, for grouping words into phrases, 
clauses, and sentences. Among the various devices that a language may 
use for predicating properties of objects and events, and for specifying their 
relations (who does what to whom) are word order and inflection (case, 
gender, and number affixes for nouns, pronouns, adjectives; person, tense, 
mood, and voice affixes for verbs). An important distinction is also made 
in all languages between open-class words with distinct meanings (nouns, 
verbs, adjectives, etc.) and closed-class or function words (conjunctions, 
articles, verbal auxiliaries, enclitics— e.g., the particle "not" in "cannot") 
that have no fixed meaning in themselves but serve the purely syntactic 
function of indicating relations between words in a sentence or sequence 
of sentences. Here again then, a combinatorial principle is invoked: a finite 
set of rules and devices is repeatedly sampled and applied to produce an 
infinite set of utterances. 

I should note that many of the facts about language summarily described 
above are already framed from the new viewpoint that has developed in 
the past 40 years. Let us now turn back the clock and consider the early 
vicissitudes of three areas of applied research that contributed to this de- 
velopment. 

Three Areas of Applied Research in Language 

In the burst of technological enthusiasm that followed World War II, 
federal money flowed into three related areas of language study: automatic 
machine translation, automatic speech recognition, and automatic reading 
machines for the blind. A considerable research effort was mounted in all 
three ar';as during the late 1940s and early 1950s, but surprisingly little 
headway was made. The reason for this, as will become clear below, was 
that all three enterprises were launched under the shield of a behaviorist 
theory according to which complex behaviors could be properly described 
as chained sequences of stimuli and responses. 

The initial assumption underlying attempts at machine translation was 
that this task entailed little more than transposing words (or morphemes) 
from one language into another, following a simple left-to-right sequence. 
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If this were so, we might store a sizable lexicon of matched Russian, say, 
and English words in a computer and execute translation by instructing the 
computer to type out the English counterpart of each Russian word typed 
in. Unfortunately, both semantic and syntactic stumbling blocks lie in the 
path. The range of meanings, literal and metaphorical, that one language 
assigns to a word (say, English high, as in **high mountain," "high pitch,'* 
"high hopes," "high horse," "high-stepping," and "high on drugs") 
may be quite different from the range assigned by another language; and 
the particular meaning to be assigned will be determined by context, that 
is, by meanings already assigned to some in principle unspecifiable sequence 
of preceding words. Moreover, the syntactic devices for grouping words 
into phrases, phrases into clauses, and clauses into sentences may be quite 
different in different languages. This is strikingly obvious when wc compare 
a heavily inflected language, such as Russian, with a lightly inflected 
language with a more rigid word order, such as English. Oettinger (1972) 
amusingly illustrates the general difficulties with two simple sentences, 
immediately intelligible to an English speaker, but a source of knotty prob- 
lems in both phrase structure and word meaning to a computer, programmed 
for left-to-right lexical assignment: Time flies like an arrow, and Fruit flies 
t t Kt a banana. From such observations, it gradually became clear that we 
would make little progress in machine translation without a deeper under- 
standing of syntax and of its relation to meaning. 

The initial assumption underlying attempts at automatic speech recog- 
nition was similar to that for machine translation and equally in error (cf. 
Reddy, 1975). The assumption was that the task entailed little more than 
specifying the invariant acoustic properties associated with each consonant 
and vowel, in a simple left-to-right sequence. One would then construct an 
acoustic filter to pass those properties but no others, and control the ap- 
propriate key on a printer by means of the output from each filter. Unfor- 
tunately, stumbling blocks lie in this path also. A large body of research 
has demonstrated that speech is not a simple left-to-right sequence of discrete 
and invariant alphabetic segments, such as we see on a panted page (e.g., 
Fant, 1962; Joos, 1948; Liberuan et al., 1967). The reason for this, as we 
shall see shortly, is that we do not speak phoneme by phoneme, or even 
syllable by syllable. At each instant our articulators are engaged in executing 
patterns of m wement that correspond to several neighboring phonemes, 
including those in neighboring syllables. The result of this shingled pattern 
of movement is, of course, a shingled pattern of sound. Even more extreme 
variation may be found when we examine the acoustic structure of the same 
syllable spoken with different stress or at different rates or by different 
speakers. From such observations it gradually became clear that we would 
make little progress in automatic speech recognition without a deeper un- 
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derstanding of how the acoustic structure of the speech signal specifies the 
linguistic structure of the message. 

Finally, the initial assumption underlying attempts to construct a reading 
machine for the blind was closely related to that for automatic speech 
recognition and again in error (Cooper et al., 1984). A reading machine is 
a device that scans print and uses its contours to control an acoustic signal. 
It was supposed that, given an adequate device for optical recognition of 
letters on a page, one need only assign a distinctive auditory pattern to each 
letter, to be keyed by the optical reader and recorded on tape or played in 
real time to a listener — a sort of auditory Braille. Once again there were 
stumbling blocks, but this time they were perceptual. We normally speak 
and listen to English at a rate of some 150 words per minute (wpm), that 
is, roughly 5 to 6 syllables or 10 to 15 phonemes per second. Ten to 15 
discrete sounds per second is close to the resolving power of the ear (20 
elements per second merge perceptually into a low-pitched buzz). Not 
surprisingly, despite valiant and ingenious attempts to improve the acoustic 
array, even the most practiced listeners were unable to follow a substitute 
code at rates much beyond that of skilled Morse code receivers, namely 
some 10 to 15 words per minute — a rate intolerably slow for any extended 
use. From this work, it gradually became clear that the only acceptable 
output from a reading machine would be speech itself. This conclusion was 
one of many that spurred development of speech synthesis by artificial 
talking machines in following years (Cooper and Borst, 1952; Fant, 1973; 
Flanagan, 1983; Mattingly, 1968, 1974). The conclusion also raised the- 
oretical questions. For example: Why can we successfully transpose speech 
into a visual alphabet, using another sensory modality, if we cannot suc- 
cessfully transpose it within its "natural" modality of sound? Why is speech 
so much more effective than other acoustic signals? Is there some peculiar, 
perhaps biologically ordained, relation between speech and the structure of 
language? We will return to these questions below. 

I have not recounted these three failures of applied research missions to 
argue that money and effort spent on them were wasted. On the contrary, 
initial failure spurred researchers to revised efforts, and valuable progress 
has since been made. Reading machines for the blind, using an artificial 
speech output, have been develcwd and are already installed in large li- 
braries (Cooper et al. , 1984). Thert now exist automatic speech recognition 
devices that recognize vocabularies of roughly a thousand words, spoken 
in limited contexts by a few different speakers (Levinson and Liberman, 
1981). Scientific texts with well-defined vocabularies can now be roughly 
translated by machine, then rendered into acceptable English by an informed 
human editor. 

These advances have largely come about by virtue of brute computational 
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force and technological ingenuity, rather than through real gains in our 
understanding of language. This is not because we have made no gains, 
for as we shall see shortly, we surely have. However, none of the devices 
that speak, listen, or understand actually speaks, listens, or understands 
according to known principles of human speech and language. For example, 
a speech synthesizer is the functional equivalent of a human speaker to the 
extent that it produces intelligible speech. But it obviously does so by quite 
different means than those that humans use: none of its inorganic compo- 
nents correspond to the biophysical structures of larynx, tongue, velum, 
lips, and jaw. Instead, a synthesizer simulates speech by means of a complex 
system of tuned electronic circuits, and resembles a speaker somewhat as, 
say, a crane resembles a human lifting a weight. We are still deeply ignorant 
of the physiological controls by which a speaker precisely coordinates the 
actions of larynx, tongue, and lips to produce even a single syllable. 

In short, the main scientific value of the early work I have described was 
to reveal the astonishing complexity of speech and language, and the in- 
adequacy of earlier theories to account for it. One important effect of the 
initial failures was therefore to prepare the ground for a theoretical revolution 
in linguistics (and psychology) that began to take hold in the late 1950s. 

THE GENERATIVE REVOLUTION IN LINGUISTICS 

The publication in 1957 of Noam Chomsky's Syntactic Structures began 
a revolution in linguistics that has been sustained and developed by many 
subsequent works (e.g., Chomsky, 1965, 1972, 1975, 1980; Chomsky and 
Halle, 1968). To describe the course of this revolution is well beyond the 
scope of this chapter. However, the impact of Chomsky's writings on fields 
outside linguistics — philosophy, psychology, biology, for example — and 
their importance for the emerging science of language has been so great 
that some brief exposition of at least their nontechnical aspects is essential. 
I should emphasize that Chomsky's work has by no means gone unchal- 
lenged (e.g., Givon, 1979; Hockett, 1968; Katz, 1981). My intent in what 
follows is not to present a brief in its defense, but simply io sketch a bare 
outline of the most influential body of work in modern linguistics. 

The central goal of Chomsky's work has been to formalize, with math- 
ematical rigor and precision, the properties of a successful grammar. He 
defines a grammar as "a device of some sort for producing the sentences 
of the language under analysis" (Chomsky, 1957, p. 11). A grammar, in 
Chomsky's view, is not concerned either with the meaning of a sentence 
or with the physical structures (sounds, script, manual signs) that convey 
it. The grammar, or syntax, of a language is a purely formal system for 
arranging the words (or morphemes) of a sentence into a pattern that a 
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native speaker would judge to be grammatically correct or at least accept- 
able. In Syntactic Structures, Chomsky compared three types of grammar: 
finite-siate, phrase-structure, and transformational grammars. 

A finite-state grammar generates sentences in a left-to-right fashion: given 
the first word, each successive word is a function of the immediately 
preceding word. (Such a model is, of course, precisely that adopted by 
B.F. Skinner in his Verbal Behavior (1957), a dernier cri in behaviorism, 
published in the same year as the "premier cri" of the new linguistics.) 
Chomsky (1956) proved mathematically, as work on machine translation 
had suggested empirically, that a simple left-to-right grammar can never 
suffice as the grammar of a natural language. The reason, stated nontech- 
nically, is that there may exist dependencies between v/ords that are not 
adjacent, and an indefinite number of phrases containing other nonadjacent 
dependencies may bracket the original pair. Thus, in the sentence, Anyone 
who eats the fruit is damned, anyone and is damned are interdependent. 
We can, in principle, continue to add bracketing interdependencies indef- 
initely, as in Whoever believes that anyone who eats the fruit is damned is 
wrong, and Whoever denies that whoever believes that anyone who eats 
the fruit is damned is wrong is right. 

In practice, we seldom construct such sentences. However, the recursive 
principle that they illustrate is crucial to every language. The principle 
permits us to extend our communicative reach by embedding one sentence 
within another. For example, even a four-year-old child may combine, We 
picked an apple and / want an apple for supper into the utterance / want 
the apple we picked for supper. Thus, the child embeds an adjectival phrase, 
we picked ( = that we picked with the relative pronoun deleted), to capture 
two related sentences in a single utterance (cf. Limber, 1973). 

Chomsky goes on to consider how we might formulate an alternative and 
more powerful grammar, based on the traditional constituent analysis of 
sentences into "parts of speech." Constituent analysis takes advantage of 
the fact that the words of any language (or an equivalent set of words and 
affixes) can be grouped into categories (such as noun, pronoun, verb, 
adjective, adverb, preposition, conjunction, article) and that only certain 
sequences of these categories form acceptable phrases, clauses, and sen- 
tences. By grouping grammatical categories into permissible sequences, we 
can arrive at what Chomsky terms a phrase-structure grammar. Such a 
grammar is "a finite set . . . of initial strings and a finite set . . . of 
'instruction formulas' of the form X->Y interpreted: 'rewrite X as Y' " 
(Chomsky, 1957, p. 29). Figure 1 illustrates a standard parsing diagram of 
the utterance, The woman ate the apple, in a form familiar to us from 
grammar school (above), and as a set of "rewrite rules" from which the 
parsing diagram can be generated (below). 
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Article Noun Verb Noun Phrase 



the woman ate Article Noun 

i I 

the apple 



Rewrite Rules 

(1) Sentence — ► Noun Phrase + Verb Phrase 

(2) Noun Phrase — ► Article + Noun 

(3) Verb Phrase — ♦ Verb + Noun Phrase 

(4) Article — ► | the, a } 

(5) Noun — ► j woman, apple... } 
(6*/ Verb — ► j ate, seized... } 

FIGURE 1 Above, a parsing diagram dividing the sentence The woman ate the apple 
into its constituents. Below, a set of rewrite rules that will generate any sentence having 
the constituent structure shown above. 

Notice, incidentally, that rewrite rules are indifferent to meaning. They 
will generate anomalous utterances such as The chocolate loved the clock, 
no less readily than The woman ate the apple. Moreover, many lative 
speakers would be willing to accept such anomalous utterances as gram- 
matically correct, even though they have no meaning, i'his hints at the 
possibility that syntactic capacity might be autonomous, a relatively in- 
dependent component of the language faculty. This is a matter to which 
we will return below. 

An important point about a set of rewrite rules is that it specifies the 
grouping of words necessary to correct understanding of a sentence. The 
sentence Lets have some good bread and wine is ambiguous until we know 
whether the adjective good modifies only bread or both bread and wine. 
The distinction may seem trivial. But, in fact, the example shows that we 
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are sensitive (or can be made sensitive) to an ambiguity that could not have 
arisen from any difference in the words themselves or in their sequence. 
Rather, the origin of the ambiguity lies in our uncertainty as to how the 
words should be grouped, that is, as to their phrase structure. A correct (or 
incorrect) interpretation of their meaning therefore depends on the listener 
(and a fortiori the speaker) being able to assign an abstract phrase structure 
to the sequence of words. 

Whether a complete grammar of English, or any other natural language, 
could be written as a set of phrase-structure rules is not clear. In any event, 
Chomsky argues in Syntactic Structures that such a grammar would be 
unnecessarily repetitive and complex, since it does not capture a native 
speaker's intuition that certain classes of sentence are structurally related. 
For example, the active sentence Eve ate the apple and the passive sentence 
The apple was eaten by Eve could both be generated by an appropriate set 
of phrase-structure rules, but the rules would be different for active sentences 
than for their passive counterparts. Surely, the argument runs, it would be 
"simpler" if the grammar somehow acknowledged their structural relation 
by deriving both sentences from a common underlying "deep structure." 
The derivation would be accomplished by a series of steps or "transfor- 
mations" whose functions are to delete, modify, or change the order of 
the base constituents Eve, ate, apple. 

An important aspect of transformations is that they are structure depen- 
dent, that is, they depend on the analysis of a sentence into its structural 
components, or constitutents. For example, to transform such a declarative 
sentence as The man is in the garden into its associated interrogative Is the 
man in the garden! 9 a simple left-to-right rule would be: "Move the first 
occurrence of is to the front." However, the rule would not then serve for 
such a sentence as The man who is tall is in the garden, since it would 
yield Is the man who tall is in the garden! The rule must therefore be 
something like: "Find the first occurrence of is following the first noun 
phrase, and move it to the front" (Chomsky, 1975, pp. 30-31). Thus, a 
transformational grammar, no less than a phrase-structure grammar, pre- 
supposes analysis of an utterance into its grammatical (or phrasal) constit- 
uents. We may note, in passing, that children learning a language never 
produce sentences such as Is the man who tall is in the garden! Rather, 
their errors suggest that, even in their earliest attempts to frame a complex 
sentence, they draw on a capacity to recognize the structural components 
of an utterance. 

However, here we should be cautious. Chomsky has repeatedly empha- 
sized that". . .a generative grammar is not a model for a speaker or hearer" 
(1965, p. 9), not a model of psychological processes presumed to be going 
on as we speak and listen. The word "generative" is perhaps misleading 
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in this regard. Certainly, experimental psychologists during the 1960s de- 
voted much ingenuity and effort to testing the psychological reality of 
transformations (for reviews, see Cairns and Cairns, 1976; Fodor et al., 
1974; Foss and Hakes, 1978). But the net outcome of this work was to 
demonstrate the force of Chomsky 's distinction between formal descriptions 
of a language and the strategies that speakers and listeners deploy in com- 
municating with each other (cf. Bever, 1970). 

At first glance, the distinction might seem to be precisely that between 
langue and parole, drawn by de Saussure. However, for de Saussure, 
langur the system of language, "exists only by virtue of a sort of contract 
signed by the members of a community" (de Saussure, 1966, p. 14): it is 
a kind of formal artifice or convention, maintained by social processes of 
which individuals may be quite unaware. By contrast, for Chomsky the 
"generative grammar [of a language] attempts to specify what the speaker 
actually knows" (1965, p. 8). What a speaker knows, his competence in 
Chomsky's terminology, is attested to by "intuitive" judgments of gram- 
maticality. What a speaker does, performance (parole), is linguistic com- 
petence filtered through the indecisions, memory lapses, false starts, 
stammerings, and the "thousand natural [nonlinguistic] shocks that flesh 
is heir to." Thus, even though a theory of grammar is not a theory of 
psychological process, it is a theory of individual linguistic capacity. 

In Chomsky's view, the task of linguistics is to describe the structure of 
language much as an anatomist might describe the structure of the human 
hand. The complementary role of psychology in language research is to 
describe language function and its course of behavioral development in the 
individual, while physiology, neurology, and psychoneurology chart its 
underlying structures and mechanisms. 

Whether this sharp distinction between language as a formal object and 
language as a mode of biological function can, or should, be maintained 
is an open question. What is clear, however, is that it was from a rigorous 
analysis of the formal properties of syntax (and later of phonology: see 
Chomsky and Halle, 1968) that Chomsky was led to view language as an 
autonomous system, distinct from other cognitive systems of the human 
mind (cf. Fodor, 1982; Pylyshyn, 1980). His writings during the late 1950s 
and 1960s brought an exhilarating breath of fresh air to psychologists in- 
terested in language, because they offered an escape from the stifling be- 
havioristic impasse, already noted by Lashley (1951) and others (e.g., Miller 
etal., 1960). 

The result was an explosion of research in the psychology of language, 
with a strong emphasis on its biological underpinnings. Whatever one's 
view of generative grammai, it is fair to say that almost every area of 
language study over th-; past 25 years has been touched, directly or indi- 
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rcctly, whether into action or into reaction, by Chomsky's work. This will 
be obvious from the following selective review of research in four major 
areas: acoustic phonetics, American Sign Language (ASL), brain special- 
ization for language, and language development in children. 

Acoustic Phonetics 

We begin with audible speech, partly because we are then following the 
course of development, both in the species and the individual, from the 
bottom up; partly because it is in this area, where we are dealing with 
observable, physical processes, that the most dramatic progress has been 
made; and partly because we have come to realize in recent years that the 
physical medium of language places fundamental constraints on its surface 
structure. To understand this we must know something of the way speech 
is produced. 

The Source-filter Theory of Speech Production The source-filter theory, 
first proposed by Johannes Miiller in 1848, has been elaborated in the past 
50 years, notably at the University of Tokyo (Chiba and Kajiyama, 1941), 
the Royal Institute of Technology in Stockholm (Fant, 1960, 1973) and, 
in this country, the Massachusetts Institute of Technology (Stevens and 
House, 1955, 1961) and Bell Telephone Laboratories (Flanagan, 1983). As 
a result of this work, we are now able to specify accurately the possible 
acoustic outputs of any vocal tract, animal or human. 

When we speak, we drive air from our lungs through the pharynx, mouth, 
teeth, lips, and, sometimes, nose. The sound source is usually either the 
" voice" produced by rapid pulsing of the vocal cords (as in the final sounds 
of be and do), the hiss of air blown through a narrow constriction (as in 
the initial and final sounds of safe and thrush) or both (as in the final sounds 
of leave and bees). The resonant filter is the vocal tract, its air set into 
vibration by the flow of air from the lungs, much as we produce sound 
from a bottle or a wind instrument by blowing air across its top. 

To some large degree linguistic information (that is, consonants and 
vov els) is conveyed by systematic variations in the configuration of the 
vocal tract. For example, if we lower the tongue and move it back toward 
the pharynx, we set up a pattern of resc lances (known as formants) cor- 
responding to the vowel [a]. If we raise the tongue forward toward the 
gums, we set up resonances for the vowel [i]. Finally, if we raise the tongue 
backward toward the soft palate, we set up resonances for the vowel [u]. 
These three sounds are the most distinct vowels, both articulatorily and 
acoustically, that the human vocal tract can produce, and all known lan- 
guages use at least twu of them. 
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[We may note in passing that Lieberman and his colleagues (Lieberman 
andCrelin, 1971; Lieberman etal., 1972) have used the source-filter theory 
of speech production to demonstrate that these vowels lie outside the range 
of sounds that could be produced either by an adult chimpanzee or by a 
newborn human infant. The reason for this is that the larynx in both chim- 
panzee and infant, is high in the throat, restricting the range of possible 
tongue movements. An advantage of the high larynx for the infant is that 
it provides an arrangement of the oral tract such that, like other mammals, 
the infant can suck through its mouth and breathe through its nose at the 
same time. Over the first six months of life, the infant's larynx lowers, a 
special swallowing reflex develops to prevent food entering the lungs, and 
the infant becomes capable of producing the vowels of the language spoken 
around it. The lowered larynx seems to be one of several adaptations of 
the vocal apparatus that have suited it for speaking as well as for eating 
and breathing.] 

Of course, we do not speak only in vowels. Rather, we speak in runs of 
syllables, alternately constricting the vocal tract to form consonants, open- 
ing it to form vowels. (This repeated opening and closing of the tract 
produces the rises and falls of amplitude that are the basis of speech rhythm 
and poetic meter.) What is of interest, as we have already remarked, is that 
the tract configurations appropriate to particular consonants and vowels do 
not follow each other in linear sequence. At any instant, each articulator 
is executing a complex pattern of movement, of which the spatiotemporal 
coordinates reflect the influence of several neighboring segments. Readers 
may test this by slowly uttering, for example, the words cool and keel. 
They will find that the position of the tongue on the palate during closure 
for the initial consonant, [k], is slightly further back for the first word than 
for the second. The result of this interleaving is that, at any instant, the 
sound is conveying information about more than one phonetic segment, 
and that each phonetic segment draws information from more than one piece 
of sound — an obvious problem for automated speech recognition. Unfor- 
tunately, we cannot, as was at one time hoped, escape from this predicament 
by building a machine to recognize syllables, because similar interactions 
between phonetic segments occur across syllable boundaries. We see all 
this quite clearly if we examine a sound spectrogram. 

The Sound Spectrograph The sound spectrograph was developed at Bell 
Telephone Laboratories during World War II, to provide a visible display 
of the acoustic spectrum of speech as it changes over time. Originally, it 
was hoped that the device would enable deaf persons to use the telephone 
(Potter et al., 1947), but this proved impracticable because spectrograms 
are formidably difficult to read (but see Cole et al., 1980). 
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Figure 2 is a spectrogram of the utterance She began to read her book. 
Frequency on the ordinate is plotted against time on the abscissa. Variations 
in relative amplitude appear as variations in the darkness of the pattern. 
The dark bars correspond to formants, that is, to resonant peaks in the vocal 
tract resonance function. Scattered patches, as at the beginning, correspond 
to the noise of fricatives, e.g., [f], [s], and stop consonants, e.g., [p], [b]. 
A series of vertical lines has been drawn, dividing the spectrogram into 
discrete, acoustic segments. There are 25 of these segments, even though 
the utterance consists of only 17 phonetic segments and 7 syllables. Some 
of these acoustic segments correspond more or less directly to phonetic 
segments: thus, segments 1 and 2 correspond to the two sounds of she. 
Segment 3, on the other hand, corresponds to the first three sounds of 
began, segments 1 1 and 12 to the first sound of to, segment 23 to the first 
two sounds of book. 

The sound spectrograph revealed, for the first time, the astonishing var- 
iability of the speech signal both within and across speakers. It was also 
the basis for the first systematic studies of speech perception, from which 
we have learned which aspects of the signal carry crucial phonetic infor- 
mation. These studies, in turn, provided the basis for the development of 
speech synthesis. Thus, artificial talking machines, now being used in 
reading machines for the blind and in a variety of human-machine com- 
munication systems, rest squarely on the shoulders of the spectrograph. 

Speech Perception Early work in speech perception was largely guided 
by the demands of telephonic communication. Its aim was to estimate how 
much distortion (by filtering, noise, peak-clipping, and so on) could be 
imposed on the signal without seriously reducing its intelligibility (Licklider 
and Miller, 1951; Miller, 1951). Two general conclusions from this work 
were surprising and important. First, speech is so resistant to distortion that 
we can throw away large parts of the signal without reducing its intelli- 
gibility. Second, intelligibility does not depend on n. turalness. These two 
facts made it possible to learn a great deal about the important information- 
bearing elements in speech by stripping it down to its minimal cues. 

Work of this kind was first undertaken at Haskins Laboratories in New 
York during the 1950s, as part of a program to develop a suitable output 
for a reading machine. The key research tool was the Pattern Playback, 
developed by F.S. Cooper (Cooper, 1950; Cooper and Borst, 1952) to 
reconvert the visual pattern of a spectrogram into sound. The pattern, painted 
on a moving acetate belt, reflects frequency-modulated light to a photocell 
that drives a speaker. Figure 3 illustrates an early spectrogram and its 
stylized copy. If the copy is passed through the playback, it produces an 
intelligible version of the utterance to catch pink salmon. The utterance 
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FIGURE 2 A spectrogram of the utterance She began to read her boo).. Frequency is plotted on the ordinate, time on the abscissa; 
relative amplitude is represented by varying degrees of darkness in the display. The dark horizontal bands reflect resonant peaks in the 
vocal tract transfer function (formants, conventionally numbered from the bottom up: first formant, second formant, etc.); the vertical 
striations reflect repeated opening and closing of the glottis (voice). Heavy vertical lines have been drawn dividing the pattern into 25 
discrete acoustic segments (see text). m 
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FIGURE 3 Above, a spectrogram of the utterance To catch pink salmon. Below, a 
stylized copy of the spectrogram, sufficient to regenerate the utterance if played on the 
Pattern Playback. 
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sounds unnatural, partly because the formant bandwidths have been sharply 
reduced, partly because it is spoken in a monotone. 

The playback made it possible for experimenters to manipulate the speech 
signal systematically, by pruning, deleting, or exaggerating portions of the 
spectrograph^ pattern until they had determined the minimal cues for any 
particular utterance (Liberman, 1957; Liberman et al., 1959). With this 
device, and with its successors at Haskins and elsewhere, a body of knowl- 
edge was built up, sufficient for synthesis by rule of relatively high-quality 
speech (Fant, 1960, 1968; Flanagan, 1983; Mattingly, 1974). 

Several reviews of the perceptual implications of this work have been 
published (Darwin, 1976; Liberman et aL, 1967; Liberman and Studdert- 
Kennedy, 1978; Studdeit-Kennedy, 1974, 1976), and I will not review them 
here. However, two facts deserve note. First, the cues for a given phonetic 
segment (that is, for a particular consonant or vowel) vary markedly as a 
function of context. Figure 4 displays spectrograms of the naturally spoken 
syllables [did] and [dud]. We know from synthetic speech that a main cue to 
the initial [d] lies in changes in the second formant after onset. Notice that 
the second formant rises before [i], falls before [u], and that the rising and 
falling patterns are precisely reversed for the final [d]. Yet all are heard as 
[d]. Moreover, if these patterns or their synthetic versions are removed from 
context and presented to listeners for judgments, they are no longer heard as 
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[did] [dud] 
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FIGURE 4 Spectrograms ot naturally spoken [did] (deed) and [dud] (dood). The 
acoustic information specifying the alveolar place of articulation of the initial and final 
consonants is primarily carried by the second formant, centered around 2 kHz for [did] 
and slightly below 1 kHz for [dud]. Note that this formant forms a parabola, concave 
downwards in [did], concave upwards in [dud]. Despite this difference, both patterns 
arc heard as beginning and ending with [d]. 
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[d], nor are they heard as invariant. Rather they are heard as rising and falling 
tones (Liberman et al., 1967). In other words, different acoustic patterns are 
heard as different in a nonspeech context but as the same in a speech context. 
This is merely one of dozens of such examples. 

The second fact of note is that despite the apparent lack of discrete 
phonetic segments in the signal, listeners have little difficulty in learning 
to find segments — so little, in fact, that a segmental representation of speech 
is the basis of the alphabet. 

The interpretation of these facts is still a matter of controversy (e.g., 
Cole and Scott, 1974; Ladefoged, 1980; Stevens, 1975), and I will not 
pursue the matter here. However, it is worth noting that such findings gave 
rise to the hypothesis that humans have evolved a specialized perceptual 
mechanism for speech, d: tinct from, though dependent on, their general 
auditory system (Liberman, 1970, 1982; Liberman and Studdert-Kennedy, 
1978; Liberman et al., 1967). The hypothesis has received substantial sup- 
port from many dozens of studies of dichotic listening over the past 20 
years (e.g., Kimura, 1961, 1967; Shankweiler and Studdert-Kennedy, 1967; 
Studdert-Kennedy and Shankweiler, 1970; for a review, see Porter and 
Hughes, 1983). The conclusion from this work, and from studies of patients 
with separated cerebral hemispheres (see section below on brain speciali- 
zation for language) , is that the left hemisphere of most normal right-handed 
individuals is specialized not only for speaking (as has been known for 
many years from studies of brain-damaged patients), but also for perceiving 
speech. Specifically, there is now good reason to believe that "while the 
general auditory system common to both hemispheres is equipped to extract 
the auditory parameters of a speech signal, the dominant [i.e., left] hemi- 
sphere may be specialized for the extraction of linguistic features from these 
parameters" (Studdert-Kennedy and Shankweiler, 1970, p. 579). 

An important implication of this conclusion is that speech forms an 
integral part of the left-hemisphere language system discussed below. With 
this in mind let us turn to recent work on American Sign Language, which 
draws on a different perceptuomotor system from that of spoken language. 

AMERICAN SIGN LANGUAGE 

Speech is the natural medium of language. Specialized structures and 
functions have evolved for spoken communication: vocal tract morphology; 
lip, jaw, and tongue innervation; mechanisms of breath control (Lenneberg, 
1967); and perhaps even (as I have just suggested) matching perceptual 
mechanisms. But is there any further specialization for language? Is lan- 
guage an autonomous system, distinct from other cognitive systems, as 
Chomsky has argued? 
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compounding, by which signs may be combined to form a new sign different 
in meaning from its components. The process is analogous to that by which, 
in English, hard and hat, say, are combined to form hardhat, meaning a 
construction worker. Thus, the lexicon of ASL can be expanded by rule, 
not simply by iconic invention. 

Second, ASL has a.* elaborate system of inflections by which it modulates 
the meaning of a word. For example, in English, changes in aspectual 
meaning (that is, distinctions in the onset, duration, frequency, recurrence, 
permanence, or intensity of an event) are indicated by concatenating mor- 
phemes. We may say, he is quiet, he became quiet, he used to be quiet, 
he tends to be quiet 9 and so on. All these meanings are conveyed in ASL 
by distinct modulations of the root sign's movement. In the root sign for 
QUIET the hands move straight down from the mouth, while for TENDS 
TO BE QUIET they move down forming a circle. Similarly, related nouns 
and verbs are also distinguished by movements, while verbs are inflected 
by movement modulation for person, number, reciprocal action, and aspect. 

Third, ASL has a spatial (rather chan a temporal) syntax. Nouns intro- 
duced into a discourse are assigned arbitrary reference points in a horizontal 
plane in front of the signer. These points then serve to index grammatical 
relations among referents: verb signs are executed with a movement between 
two points, or across several points, to indicate subject and object. Thus, 
a grammatical function variously served in spoken language by word order, 
case markers, verb inflections, and pronouns is fulfilled in ASL by a spatial 
device. 

Finally, ASL has a variety of syntactic devices that make use of the face. 
Liddell (1978) has shown that a relative clause ("The apple that Eve offered 
tempted him") may be marked by tilting back the head, raising the eye- 
brows, and tensing the upper lip for the duration of the clause. Baker and 
Padden (1978) describe gestures of the face and head that mark the juncture 
of conditional clauses ("If you eat the fruit, you will be punished"). 

In short, though structural analysis of ASL is far from complete, it is 
evident that the language has a dual pattern of form and syntax, fully 
analogous to that of a spoken language. Nonetheless, there are differences. 
The main structural difference between ASL and English was illustrated 
by Klima and Bellugi (1979) in a comparison of their rates of communi- 
cation. The times taken to tell a story in the two languages were almost 
exactly equal. Yet the speaker used two to three times as many words as 
the signer used signs. The reason for the discrepancy, already hinted at, 
lies in the temporal distribution of information. Speech, for the most part, 
develops its patterns in time, sequentially, while ASL develops its patterns 
both simultaneously, in space, and sequentially. The difference is evidently 
due to the difference in the perceptual modalities addressed. Sign, addressed 
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to the eye, is free to package information in parallel; speech, addressed to 
the ear, is forced into a serial mode. What is interesting, of course, is that 
despite constraints of modality, the two languages convey information at 
roughly the same rate. This suggests that they may be operating under the 
same temporal constraints of cognition. 

What, finally, are the implications of this work for the study of speech 
and language? Evidently, the dual structure of language is not a mere 
consequence of perceptuomotor modality but a reflection of cognitive re- 
quirements. Whether these cognitive requirements are linguistic rather than 
general is still not clear. Differently put, we still do not know whether the 
relation between signed and spoken language is one of analogy or homology. 
If the two systems prove to be homologous, that is, if they prove to draw 
on the same neural structures and organization, we will have strong evidence 
that language is a distinct cognitive faculty. However, if they do not draw 
on the same underlying neural organization, we might suppose that linguistic 
structure is purely functional, the adventitious consequence of a cognitively 
complex animal's attempt to communicate its thought. Studies of sign- 
language breakdown due to brain injury, discussed below, are therefore of 
unusual interest and importance. 

BRAIN SPECIALIZATION FOR LANGUAGE 

Most of our knowledge of brain specialization for language comes from 
those "experiments of nature" in wnich soiae more or less circumscribed 
lesion (due to stroke, epilepsy, congenita! malformation, gunshot wounds, 
and so on) proves to be correlated with some more or l^ss circumscribed 
cognitive or linguistic deficit (for a brief account of modern brain-scannins 
techniques, see Benson, 1983, and references the r iin). Recently, our sources 
of knowledge have been expanded by use of brain stimulati n, preparatory 
to surgery under local anesthesia (Ojemann, 1983, and references therein), 
and by studies of so-called "split brain" patients whose cerebral hemi- 
spheres have been separated surgically for relief of epilepsy (see below). 
Some degree of concordance between patterns of brain localization in nor- 
mal and abnormal individuals has been established *>y experiments on nor- 
mals in which visual or auditory input is confirM, or more clearly delivered, 
to one hemisphere rather than the other (Moscovitch, J 983). 

Evidence From Studies of Aphasia 

The term aphasia refers to some impairment in language function, whether 
of comprehension, production, or both, due to some more or less well 
localized damage to the brain. Systematic study of aphasia goes back well 



236 



SOME DEVELOPMENTS IN RESEARCH ON LANGUAGE BEHAVIOR 



over a hundred years, and the Iite~*nre of the subject is vast (for reviews, 
see, for example, Goodglass and Geschwind, 1976; Hecaen and Albert, 
1978; Lesser, 1978; Luria, 1966, 1970). The most that can be done here 
is to hint at one area in which linguistics (that is, formal language descrip- 
tion) has begun to affect aphasia studies. 

Until recently, the standard framework for describing aphasic symptoms 
was that of the language modalities: speaking, listening, reading, and writ- 
ing, or, more generally, the dimensions of expression and reception. These 
are still the dimensions of the major test batteries used to diagnose aphasia, 
such as the Boston Diagnostic Aphasia Examination (Goodglass and Kap- 
lan, 1972). An important assumption, underlying any attempt at diagnosis, 
is that damage to a particular region of the brain has particular, not general, 
effects on language function. The assumption has strong empirical support 
and has led to the isolation of two (among several other) broad types of 
aphasia, nonfluent and fluent, respectively associated with damage to the 
left cerebral hemisphere in an anterior region around the third frontal con- 
volution (Broca's area) and a posterior region around the superior temporal 
convolution (Wernicke's area). 

Broca's area lies close to the motor strip of the cortex (in fact, close to 
that portion of the strip associated with motor control of the jaw, lips, and 
tongue), while Wernicke's area surrounds the prima y auditory region. In 
accord with this anatomical dissociation, a Broca's aphasic (that is, an 
individual with damage to Broca's area) has been classically found to be 
nonfluent: having good comprehension but awkward speech, characterized 
by pauses, difficulties in word-finding and distorted articulation; utterances 
are described as "telegrammatic," consisting of simple, declarative sen- 
tences, relying on nouns and uninflected verbs, omitting grammatical mor- 
phemes or function words. By contrast, a Wernicke's aphasic has been 
f ound to have poor comprehension, even of single words, but fluent speech, 
composed of inappropriate or nonexistent (though phonologically correct) 
words, often inappropriately inflected and/or out of order. 

Notice that these descriptions are still couched in terms of input and 
output — that is, modalities of behavior — rather than in linguistic terms. 
The idea that linguistic theory should be brought to bear on aphasia, and 
attempts made to charac rize deficits in terms of overarching linguistic 
function, has been proposed a number of times in the past (e.g., Jakobson, 
1941; Pick, 1913). But only recently (again, partly under the influence of 
Chomsky's view of language as an autonomous system, composed of au- 
tonomous syntactic and phonological subsystems) has the idea begun to 
receive widespread attention. The general hypothesis of the studies de- 
scribed below is that language breaks down along linguistic rather than 
modal lines of demarcation. 
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We will focus mainly on the hypothesis that syntactic competence is 
discretely and coherently representeu in Broca's area of the left frontal lobe. 
If this is so, the clinical impression that Broca's aphasics have good com- 
prehension, despite their agrammatic speech (and, incidentally, writing), 
must be in error. More carefu! testing should reveal deficits in their com- 
prehension, also. 

Caraiiazza and Zurif (1976) tested this hypothesis with three types of 
sentences: (1) simpie declarative sentences in which semantic constraints 
might permit decoding without appeal to syntax (The apple that the boy is 
eating is red)\ (2) so-called reversible sentences that require knowledge of 
syntactic relations for decoding (The boy that the girl is chasing is tall); 
and (3) implausible, though grammatically correct, sentences (The boy that 
the dog is patting is fat). The sentences were presented orally, and patients 
were asked to choose which of two pictures represented the meaning of the 
sentence. The incorrect alternative showed either a subject-object reversal 
or an action different from that specified by the vert). 

Broca's aphasics performed very well on simple declarative sentences 
and on sentences with strong semantic constraints (as when the incorrect 
alternative depicted the wrong action). On reversible plausible and im- 
plausible sentences (when the incorrect alternative depicted a subject-object 
reversal) the patients' performance was at chance. Caramazza and Zurif 
(1976) concluded that the clinical impression of good comprehension in 
Broca's aphasics was due to their ability to draw on semantic and pragmatic 
constraints to understand sentences despite their inability to process syntax. 

Other studies have shown that Broca's aphasics have difficulty in parsing 
a sentence into its grammatical constituents (Von Stockert, 1972); cannot 
use articles to assign appropriate reference in understanding a sentence 
(Goodenough et al., 1977); and cannot, in general, access closed-class 
grammatical morphemes (Zurif and Blumstein, 1978). These studies arc 
not without their critics (e.g., Linebarger et al., 1983), nor is the general 
claim that aphasic breakdown is typically (or, indeed, ever) along purely 
linguistic lines (Studdert-Kennedy, 1983, pp. 193-194): the locus and ex- 
tent of brain damage in aphasia is largely a matter of chance, and it is rare 
that language alone is affected. However, we have other sources of evidence 
to test the hypothesis that syntax is represented in the brain as a functionally 
discrete subsystem. 



One source of evidence is the split-brain patient whose cerebral hemi- 
spheres have bee separated surgically for relief of epilepsy. The condition 
permits an investigator to assess the cognitive and linguistic capacities of 
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each hemisphere separately. Zaidel (1978) has devised a contact lens, opaque 
on either the nasal or temporal side, that can be used (profiting from 
decussation of th« optic pathways) to ensure that visual information is freely 
scanned by a single hemisphere. A variety of written verbal materials — 
nonsense syllables, words, sentences of varying length and complexity — 
and pictures can then be used to test the capacities of the isolated hemi- 
spheres. For example, the sentences, The fish is eating or The fish are 
eating, can be presented to a single hemisphere, together with appropriate 
alternative pictures, to test the hemisphere's capacity to understand written 
verbal auxiliaries (is, are) (Zaidel, 1983). Similarly, pictures of various 
objects belonging to different classes (fruit, furniture, vehicles, etc.) might 
be presented to a single hemisphere to test the hemisphere's capacity to 
categorize. 

The number of available subjects is, of course, limited. But the conclu- 
sions from studies of four split-brain patients are remarkably consistent 
(Zaidel, 1978, 1980, 1983). In general, each hemisphere seems to have "a 
complete cognitive system with its own perception, memory, language, 
and cognitive abilities, but with a unique profile of competencies: good on 
some abilities, poor on others" (Zaidel, 1980,p. 318). Of particular interest 
in the present context is the finding that, although the right hemisphere 
cannot speak, it has a sizable auditory and reading lexicon. However, unlike 
the left hemisphere, the right cannot read new (nonsense o* unknown) words 
or recognize words for which it has no semantic interpretation. Similarly, 
the right hemisphere cannot group pictures of objects on the basis of rhyme 
(e.g., nail, male). Evidently, phonological analysis is the prerogative of 
the left hemisphere. 

The syntactic capacity of the right hemisphere is also limited. The hemi- 
sphere can recognize verbal auxiliaries (see above), but has difficulty in 
discriminating inflections (The fish car versus The fish eats). Similarly, the 
right hemisphere can recognize and interpret nouns, adjectives, and certain 
prepositions, but has difficulty with the English infinitive marker to. These 
findings on closed-class morphemes mesh to a degree with the deficits of 
Broca's aphasics, described above. Not surprisingly, the right hemisphere's 
capacity to understand sentences is sharply reduced: it cannot deal with 
sentences longer than about three words. 

On the evidence of these studies, then, the right hemisphere has essen- 
tially no phonological capacity and only a limited syntactic capacity. Un- 
fortunately, the limited syntactic capacity is equivocal because all these 
split-brain patients have had epilepsy since early childhood. Brain disorders 
are known to lead to reorganization and redistribution of function, partic- 
ularly in childhood (Lenneberg, 1967; Dennis, 1983). We cannot therefore 
be sure that such syntactic capacity as the right hemisphere displays does 
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not reflect compensation for left hemisphere deficiencies, induced by 
epilepsy. 



Studies of normally hearing, brain-damaged patients have established a 
double dissociation of brain locus and function in right-handed individuals: 
the left cerebral hemisphere is specialized for language, the right hemisphere 
for visual-spatial functions (as revealed, for example, by tests requiring a 
subject to copy a drawing, assemble wooden blocks into a pattern, or 
discriminate between photographs of unfamiliar faces). As we have seen, 
ASL is an autonomous linguistic system with a dual structure analogous to 
that of spoken language, on the one hand; yet, on the other, it encodes its 
meanings in visual-spatial rather than auditory-temporal patterns. How then 
should we expect brain damage to affect the language of a native ASL 
signer? 

The answer bears directly on our understanding of the basis of brain 
specialization for language. For if language loss in ASL aphasia follows 
damage to the right hemisphere, we may infer that language is drawn to 
the hemisphere controlling its perceptuomotor channel of communication. 
But if language loss follows damage to the left hemispheie, we may infer 
that the neural structure of that hemisphere is, in some sense, matched to 
the structure of language, whatever its modality. Language might then be 
seen as a distinct cognitive faculty, sufficiently abstract in its descriptive 
predicates to encompass both speaking and signing. 

Recent studies at the Salk Institute, the first systematic and linguistically 
motivated studies of ASL aphasia on record, support the second hypothesis. 
Moreover, the forms of ASL breakdown vary with locus of lesion in a 
fashion strikingly similar to certain forms of spoken-language breakdown. 
Bellugi, Poizner, and Klima (1983) describe three patients, all of whom 
are native ASL signers and display normal visual-spatial capacity for non- 
language functions. Their symptoms, resulting from strokes, divide readily 
into the two broad classes noted above for spoken language: two patients 
are fluent, one is nonfluent. 

The two fluent patients display quite different symptoms, coordinated 
with different areas of damage to the left hemisphere. The deficits of one 
patient (PD) are primarily grammatical; the deficits of the other (KL) are 
primarily lexical. PD has extensive subcortical damage from below Broca's 
area in the frontal lobe through the parietal to the temporal lobe, abutting 
Wernicke's area. PD produces basically normal root signs, but displays an 
abundance of semantic and grammatical paraphasias. He produces many 
semantically displaced signs (e.g., EARTH for ROOM, BED for CHAIR, 
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DAUGHTER for WIFE). More strikingly, he often modulates an appro- 
priate root form with an inappropriate or nonsensical inflection. Finally 
(despite his normal nonlanguage visual-spatial capacity), his spatial syntax 
is severely disordered: he misuses or avoids spatial indexing (the equivalent 
of pronominal function, as noted above), and overuses nouns. 

The second fluent patient, KL, has more limited damage, extending in 
a strip across the left parietal lobe. Her deficits, though relatively mild, are 
almost the reverse of PD's. First, she avoids nouns and overuses pronouns 
(spatial indexing). Second, she tends to make formational errors in root 
signs, producing nonsense items by substituting incorrect hand configura- 
tions, places of articulation, or movements. Thus, these two fluent patients 
display almost complementary deficits, breaking along linguistic fault lines, 
as it were, between lexicon and grammar. 

The third patient (GD) is nonfluent. She has massive damage over most 
of the left frontal lobe, including Broca's area. She produces individual 
signs correctly (with her nondominant hand, due to paralysis of the right 
side of her body), and can repeat a test series of signs rapidly and accurately, 
so that her deficits are not simply motoric. Yet her spontaneous signing 
invites description by just those epithets that characterize a Broca's aphasic. 
Her utterances are slow, effortful, short and agrammatic, largely made up 
of open-class items. She omits all grammatical formatives, including in- 
flections, morphological modulations, and most spatial indices. In short, 
this patient, too, displays a peculiarly linguistic rather than a general cog- 
nitive pattern of breakdown. 

From this brief review of brain specialization for language we may draw 
several conclusions. First, language breakdown seems to follow rough lin- 
guistic lines of demarcation, indicating that phonology (or patterns of sign 
formation) and syntax may be supported by separable neural subsystems 
within the left hemisphere. Second, left hemisphere specialization does not 
rest on a particular sensorimotor channel. Rather, the hemisphere supports 
general linguistic functions, common to both spoken and signed language. 
Thus, despite the left hemisphere's innate predisposition for speech (see 
section below on language acquisition), its initial neural organization is 
sufficiently plastic to admit quite different language forms (cf. Neville, 
1980; Neville et al. , 1982). At the same time, we still do not know enough 
about the anatomy and physiology of the brain to be sure that areas important 
for particular functions in spoken language precisely correspond to areas 
important for analogous functions in signed language: the issue of analogy 
versus homology is not yet closed. 

Several further cautions should be noted. It is not yet clear (either from 
linguistic theory or from behavioral evidence) that syntax and phonology 
constitute homogeneous functions: some aspects of syntax and phonology 
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fixed principles of ordering and applicability and containing a certain fixed 
substructure" (1972, p. 75). Second, the descriptive predicates of this sys- 
tem (grammatical categories, phonological classes) are not commensurate 
with those of any other known system in the world or in the mind. Third, 
the data available to the child in the speech of others is "meager and 
degenerate." Fourth, no known theory of learning— least of all a stimulus- 
response reinforcement theory of the kind scathingly criticized by Chomsky 
in his review (1959) of Skinner's Verbal Behavior (1957)— is adequate to 
account for a cliild's learning a language. Chomsky (1972) therefore assigns 
to the mind an innate property, a schema constituting the "universal gram- 
mar" to which every language must conform. The schema is highly re- 
strictive, so that the child's search for the grammar of the language it is 
learning will not be impossibly long. 

Chomsky (1972) then divides the research task into three parts. First is 
the linguist's task: to define the essential properties of human language, 
the schema or universal grammar. Second is the psychologist's task of 
determining the minimal conditions that will trigger the child's innate lin- 
guistic mechanisms. The third task, closely related to the second, arises 
from the assumption that most of the utterances a child hears are not well 
formed. How then is the child to know which utterances to accept as 
evidence of the grammar it is searching for and which utterances to reject? 
The third task is therefore to discover the nature of the relation between a 
set of data and a potential grammar, sufficient to validate the grammar as 
a theory of the language being learned. 

The proposition that language is an innate faculty of the human mind 
has a long history in Western thought from Plato to Darwin. The proposition 
is logically independent of any particular theory of language structure. 
Indeed, the entire enterprise of generative grammar might fail, yet leave 
the claim of innateness untouched. Certainly Chomsky's linguistic theories 
have been, and continue to be, a rich source of hypothesis and experiment 
in studies of language acquisition. However, his principle achievement in 
this area has been to force recognition that the learning of t language is an 
extraordinarily complex process with profound implications for the nature 
of mind. He has formulated the problem of language learning more precisely 
than ever before, spelling out its logical prerequisites in a fashion that 
promises to lead, given appropriate research, to a more precise specification 
of the innate "knowledge" that a child must bring to bear if it is ever to 
learn a language at all. 

As we have noted, Chomsky's challenge precipitated a vast quantity of 
research. The first need was for data, for systematic descriptions of how 
language actually develops. Work initially concentrated on syntactic de- 
velopment (e.g., Brown, 1973), but in the past dozen years has expanded 
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fixed principles of ordering and applicability and containing a certain fixed 
substructure" (1972, p. 75). Second, the descriptive predicates of this sys- 
tem (grammatical categories, phonological classes) are not commensurate 
with those of any other known system in the world or in the mind. Third, 
the data available to the child in the speech of others is "meager and 
degenerate." Fourth, no known theory of learning— least of all a stimulus- 
response reinforcement theory of the kind scathingly criticized by Chomsky 
in his review (1959) of Skinner's Verbal Behavior (1957)— is adequate to 
account for a cliild's learning a language. Chomsky (1972) therefore assigns 
to the mind an innate property, a schema constituting the "universal gram- 
mar" to which every language must conform. The schema is highly re- 
strictive, so that the child's search for the grammar of the language it is 
learning will not be impossibly long. 

Chomsky (1972) then divides the research task into three parts. First is 
the linguist's task: to define the essential properties of human language, 
the schema or universal grammar. Second is the psychologist's task of 
determining the minimal conditions that will trigger the child's innate lin- 
guistic mechanisms. The third task, closely related to the second, arises 
from the assumption that most of the utterances a child hears are not well 
formed. How then is the child to know which utterances to accept as 
evidence of the grammar it is searching for and which utterances to reject? 
The third task is therefore to discover the nature of the relation between a 
set of data and a potential grammar, sufficient to validate the grammar as 
a theory of the language being learned. 

The proposition that language is an innate faculty of the human mind 
has a long history in Western thought from Plato to Darwin. The proposition 
is logically independent of any particular theory of language structure. 
Indeed, the entire enterprise of generative grammar might fail, yet leave 
the claim of innateness untouched. Certainly Chomsky's linguistic theories 
have been, and continue to be, a rich source of hypothesis and experiment 
in studies of language acquisition. However, his principle achievement in 
this area has been to force recognition that the learning of t language is an 
extraordinarily complex process with profound implications for the nature 
of mind. He has formulated the problem of language learning more precisely 
than ever before, spelling out its logical prerequisites in a fashion that 
promises to lead, given appropriate research, to a more precise specification 
of the innate "knowledge" that a child must bring to bear if it is ever to 
learn a language at all. 

As we have noted, Chomsky's challenge precipitated a vast quantity of 
research. The first need was for data, for systematic descriptions of how 
language actually develops. Work initially concentrated on syntactic de- 
velopment (e.g., Brown, 1973), but in the past dozen years has expanded 
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to include phonology (e.g., Yeni-Komshian etal., 1980), semantics (e.g., 
Carey, 1982; MacNamara, 1982), and pragmatics (e.g., Bates and 
MacWhinney, 1982). As data have accumulated it has become possible to 
answer many questions and, of course, to ask many more. 

When does language development begin? Can we isolate reliable stages 
of development across children? Do the same stages occur in different 
language environments? Is the input to the child truly "meager and de- 
generate"? Is the child really constructing a grammar? Is the process pas- 
sive, or must the child actively engage itself? What is the role of imitation? 
Do we have to posit innate proclivities? If so, are they indeed purely 
linguistic? And so on. 

To see the force of these questions, we must have a sense of the com- 
plexity of the task that faces a child learning its native language. From our 
discussion of the problems of speech perception and automatic speech 
recognition, it will be obvious that we have much to learn about how the 
infant discovers invariant phonetic and lexical segments in the speech signal. 
We still do not know how the infant learns the basic sound pattern of a 
language during its first two years of life and comes to speak its first few 
dozen words. But let us set these puzzles aside and go straight to early 
syntax, where the bulk of child language research has been concentrated. 
The goal of this work has been to infer from a child's utterances (perfor- 
mance) what it "knows" {competence) about grammar and the meanings 
encoded by grammar, at each stage of its development. 

Consider, as an example, the sentence cited above, / want the apple we 
picked for supper, a sentence comfortably within the competence of a four- 
year-old child. What must a child know to produce such a sentence? We 
will look at three aspects of its structure to illustrate the basis of Chomsky's 
claim that grammatical categories do not map in any simple way onto the 
categories of general cognition. 

(1) Word order A child who utters the sentence evidently knows the 
standard subject-verb-object (SVO) order of English and so says, / want 
the apple. The child does not say as (transposing into English) a Turkish 
or Japanese child might say, / the apple want (SOV) or The apple I want 
(OSV). Presumably, the English-speaking child has long since learned that 
Adam loves Eve does not mean the same as Eve loves Adam. A Turkish or 
Japanese child, on the other hand, would have learned that uncertainties, 
due to variable word order, as to the underlying relations expressed in a 
sentence (who does what to whom) are resolved by attaching appropriate 
suffixes to subject and object (Slobin, 1982). 

So far, the mapping between grammar and world, in the threv languages, 
would seem to be arbitrary but direct. However, we are given pause by 



SOME DEVELOPMENTS IN RESEARCH ON LANGUAGE BEHAVIOR 



237 



another phrase in our example, the apple we picked ( = the apple that we 
picked). Here, in an object relative clause, the order of subject (we) and 
object (apple) is reversed, and the verb (picked) appears at the end, giving 
OSV. The switch from SVO (we picked that) to OSV (that we picked) is 
obligatory in English object relative clauses. Notice that, to apply this rule, 
a child cannot draw on any knowledge of the world; rather, it must (in 
some sense) know the grammatical structure of the sentence. We have here, 
then, another example of structure dependence, noted above in our dis- 
cussion of interrogatives. 

(2) Use of the article The child says, / want the apple, not / want an 
apple. Of course, if many apples had been picked, an apple would have 
been correct. The distinction between definite and indefinite articles seems 
natural to an English speaker. To a speaker of Russian, Chinese, or other 
languages in which articles are not used, the distinction might seem tiresome 
and unnecessary. In fact, rules for use of articles in English are complex 
and, with respect to the aspects of the world that they encode, seemingly 
arbitrary. Yet the rules are learned by the third or fourth year of life (Brown, 
1973, p. 271). 

(3) Noun phrases As a final example, consider the noun phrase the 
apple we picked. These four words (article + noun + adjectival phrase) 
form the grammatical object of the sentence. A child who utters them must 
already know the general rule for constructing noun phrases in English: the 
adjective goes before the noun (the red apple), not, as in French, after the 
noun (la pomme rouge). However, there is an exception to the rule: if the 
adjective is itself a phrase (that is, a relative clause: that we picked), the 
adjective must follow the noun (the apple we picked, not the we picked 
apple). Once again, the child reveals in its utterance knowledge of a rule 
of English grammar that cannot be derived from knowledge of the wcild. 

In short, there are solid grounds for believing that language structure 
(both at the level of sound pattern, or phonology, and at the level of syntax) 
may be sui generis. With this in mind let us briefly review some of what 
we know about the course of development, with particular attention to the 
questions with which we began. 

The infant is biologically prepared to distinguish speech from nonspeech 
at, or very soon after, birth. A double dissociation of the left cerebral 
hemisphere for perceiving speech and of the right hemisphere for perceiving 
nonspeech sounds within days of birth has been demonstrated both elec- 
trophysiologically (e.g., Molfese, 1977) and behaviorally (e.g., Segalowitz 
and Chapman, 1980). Further, dozens of experiments in the past 10 years 
have shown that infants, in their first six months of life, can discriminate 
virtually any adult speech contrast from any language on which they are 
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tested (e.g., [b] versus [pj, [d] versus [g], [m] versus [n]) (Aslin et aL, 
1983; Eimas, 1982). There is also evidence that infants begin to recognize 
the function of such contrasts, to distinguish words in the surrounding 
language, during the second half of their first year (Werker, 1982). (For 
fuller review, see Studdert-Kennedy, 1985.) 

In terms of sound production, Oiler (1980) has described a regular pro- 
gression from simple phonation (0-1 months) through canonical babbling 
(7-10 months) to so-called variegated babbling (1 1-12 months). The pho- 
netic inventory of babbled sounds is strikingly similar across many lan- 
guages and even across hearing and deaf infants up to the end of the first 
year (Locke, 1983). These similarities argue for a universal, rather than 
language-specific, course of articulatory development. 

However, around the end of the twelfth month, when the child produces 
its first words, the influence of the surrounding language becomes evident. 
From this point on, universals become increasingly difficult to discern, 
because whatever universals there may be are masked by surface diversity 
among languages. In this respect, the development of language differs from 
the development of, say, sensorimotor intelligence or mathematical ability 
(cf. Gelman and Brown, this volume). Nonetheless, we can already trace 
some regularities acros" chilf* n within a language and, to some lesser 
extent, across languages. 

The most heavily studied stage of early syntactic development, in both 
English and some half-dozen other languages, is the so-called two- 
morpheme stage. Brown (1973) divides early development into fl stages 
on the basis of mean length of utterance (MLU), measured in terms of the 
number of morphemes in an utterance. The stages are "not . . . true stages 
in Piaget's sense" (Brown, 1973, p. 58), but convenient, roughly equi- 
distant points from MLU = 2.00 through MLU = 4.00. The measure 
provides an index of language development independent of a child's chro- 
nological age. 

Of interest in the present context is that no purely grammatical description 
of Stage I (MLU = 2.00, with an upper bound of 5.00) has been found 
satisfactory. Instead, the data are best described by a "rich interpretation," 
assigning a meaning or function to an utterance on the basis of the context 
in which it occurs. Brown lists eleven meanings for Stage I constructions, 
including: naming, recurrence (more cup), nonexistence (all gone egg), 
agent and action (Mommy go), agent and object (Daddy key), action and 
location (sit chair), entity and location (Baby table), possessor and pos- 
session (Daddy chair), entity and attribute (yellow block). Brown (1973) 
proposes that these meanings "derive from sensorimotor intelligence, in 
Piaget's sense . . . [and] probably are universal in humankind but not . . . 
innate" (p. 201). 
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We should emphasize that these Stage I patterns reflect semantic, not 
grammatical, relations even though they may be necessary precursors to 
the grammatical relations that develop during Stage II (MLU = 2.50, with 
an upper bound of 7.00). Brown (1973) traced the emergence of 14 gram- 
matical morphemes in three Stage II English-speaking children. The mor- 
phemes included: prepositions (in, on), present progressive (/ am playing), 
past regular (jumped), past irregular (broke), plural -s, possessive -s, third 
persons -s (he jumps), and others. The remarkable finding was that all three 
children acquired the morphemes in roughly the same order (with rank order 
correlations between pairs of children of 0.86 or more). This result was 
confirmed in a study of 21 English-speaking children by de Villiers and 
deVilliere (1973). 

However, unlike the meanings and functions of Stage I, the more or less 
invariant order of morpheme acquisition of Stage II has not been confirmed 
for languages other than English. Perhaps we should not expect that it will 
be. Languages differ, as we have seen, in the grammatical devices that 
they use to mark relations within a sentence. The devices used by one 
language to express a particular grammatical relation may be, in some 
uncertain sense, "easier" to learn than the devices used by another language 
for the same grammatical relation. Slobin (1982) has compared the <iges at 
which four equivalent grammatical constructions are learned ir >tkish, 
Italian, Serbo-Croatian, and English. In each case, the Turkish children 
developed more rapidly than the other children. If these results are valid 
and not mere sampling error, the "studies suggest that Turkish is close to 
an ideal language for early acquisition" (Slobin, 1982. p. 145). 

Unless we suppose that Turkish parents are more attentive to their chil- 
dren's language than Italian, Serbo-Croatian, and English parents, we may 
take this result as further evidence that "selection pressures" (reinforce- 
ment) have little roh to play in language learning. Brown and Hanlon 
(1970) showed some years ago that parents tend to correct the pronunciation 
and truth value, rather tha , the syntax, of their cnildren's speech. Indeed, 
one of the puzzles of language development is why children improve at all. 
At each stage, the child's speech seems sufficient to satisfy us needs. Neither 
reinforcement nor imitation ot adult speech suffices to explain the improve- 
ment. Early speech is replete with forms that the child has presumably 
never heard: two sheeps, we goed, mine boot. These errors reflect not 
imitation, but over-generalization of rules for forming plurals, past tenses, 
and possessive adjectives. 

We come then to a guiding assumption of much current research: Learning 
a first language entails active search for language-specific grammatical 
patterns (or rules) to express universal cognitive functions. The child may 
be helped in this by the relative "transparency" (Slobin, 1980) of the speech 
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addressed to it — either because the language itself, like Turkish, is trans- 
parent and/or because adult speech to the child is conspicuously well formed. 
Several studies (e.g., Newport et al., 1977) have shown that the speech 
addressed to children tends not to be "degenerate." Yet the speech may 
be "meager" in the sense that relatively few instances suffice to trigger 
recognition of a pattern (Roeper, 1982). Such rapid learning would seem 
to require a system specialized for discovering distinctive patterns of sound 
anu syntax in any language to which a child is exposed. 

Finally, it is worth remarking that all normal children do learn a language, 
just as they learn to walk. Western societies acknowledge this in their 
attitude to children who fail: we regard them as handicapped or defective, 
and we arrange clinics and therapeutic settings to help them. As Dale (1976) 
has remarked, we do not do the same for children who cannot learn to play 
the piano, do long division, or ride a bicycle. Of course, children vary in 
intelligence, but not until I . Q. drops below about 50 do language difficulties 
begin to appear (Lenneberg, 1967). Children at a given level of maturation 
also vary in how much they talk, what they talk about, and how many 
words they know. Where they vary little, it seems, is in their grasp of the 
basic principles of the language system — its sound structure and syntax. 

CONCLUSION 

The past 50 years have seen a vast increase in our knowledge of the 
biological foundations of language. Rather than attempt even a sampling 
of the issues raised by the research we have reviewed, let me end by 
emphasizing a point with which I began: the interplay between basic and 
applied research, and between research and theory. 

The advances have come about partly through technological innovations, 
permitting, for example, physical analysis of the acoustic structure of speech 
and precise localization of brain abnormalities; partly through methodolog- 
ical gains in the experimental analysis of behavior; partly through growing 
social concern with the blind, the deaf, and otherwise language-handi- 
capped. Yet these scattered elements would still be scattered had they not 
been brought together by a theoretical shift from description to explanation. 

Perhaps the most striking aspect of the development is its unpredictability. 
Fifty years ago no one would have predicted that formal study of syntax 
would offer a theoretical framework for basic research in language acqui- 
sition, now a thriving area of modern experimental psychology, with im- 
portant implications for treatment of the language-handicapped. No one 
would have predicted that applied research on reading machines for the 
blind would contribute to basic research in human phonetic capacity, lending 
experimental support to the formal linguistic claim of the independence of 
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phonology and syntax. Nor, finally, would anyone have predicted that basic 
psycholinguists research in American Sign Language would provide a 
unique approach to the understanding of brain organization for language 
and to testing the hypothesis, derived from linguistic theory, that language 
is a distinct faculty of the human mind. 

Presumably, continued research in the areas we have reviewed and in 
related areas that we have not (such as the acquisition of reading, the motor 
control and coordination of articulatory action, second language learning), 
will consolidate our view of language as an autonomous system of nested 
subsystems (phonology, syntax). Beyond this lies the further task of un- 
folding the language system, tracing its evolutionary and ontogenetic origins 
in the nonlinguistic systems that surround it and from which, in the last 
analysis, it must derive. We would be rash to speculate on the diverse areas 
of research and theory that will contribute to this development. 

* * * 

I thank Ignatius Mattingly for comments and advice. 
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INTRODUCTION 

Experimental psychology started with the study of how we perceive 
pictures and of the conditions under which one object is an effective sur- 
rogate for another (that is, the two objects elicit the same effect). Such 
study has served the purposes of other disciplines as well, and remains 
inherently interdisciplinary. 

Prior to 1850 the problem was primarily pursued by artists and philos- 
ophers, and the conceptual tools were essentially those of physics and 
geometry. In the classical period, roughly from 1850 to 1950, the primary 
theoretical concerns were those of neurophysioiogists and psychologists. 
Major applications — in visual prosthesis (e.g., optometry and ophthal- 
mology), the visual media (e.g., photography, print, and eventually tele- 
vision), and the interface between human and machine currently called 
human factors — motivated much of the research that provided a rich base 
of technical data. 

The present period of tremendous ferment started around 1950. The 
problems of perception continue to engage all the disciplines aheady men- 
tioned; in addition, computer science is now a major presence in the field, 
providing tools and motivation in several distinct but closely related ways: 
as a source of techniques for research, theory testing, and modeling; as a 
source of analogies and metaphors; as an overlapping enterprise, seeking 
to devise machines that will "perceive" in the same way that people do; 
and in the context of learning how to generate and display computer images 
that humans can readily and accurately comprehend. 
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THE PRE-1850s: ARTISTS, PHILOSOPHERS, AND PHYSICISTS 

Artists have known for centuries that one way to produce a picture is to 
make a surrogate object that (ideally) offers the eye the same pattern of 
light as that offered by the scene itself. The most famous example of this 
is Leonardo's window (Figure 1A): By tracing the outlines of objects on a 
plane of glass interposed between his eye and the scene, the artist discovers 
the characteristics of a two-dimensional projection of a three-dimensional 
scene. Of course, the method could be used to provide pictures of existing 




(A) 




FIGURE 1 Surrogates and their preparation. A: One of the optical aids that artists 
have used for centuries (Durer) to help in preparing a surrogate that provides the eye 
with much of the same stimulus information as the object or scene being represented. 
B: By studying the tracings made of scenes viewed through a glass pane Leonardo 
advised that artists could learn the characteristic two-dimensional projections of three- 
dimensional layouts and could then construct pictures of imagined scer*s. 
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scenes with no need for the artist to learn anything: the scene could be 
traced directly on the glass (Figure IB) or— with the growth of technology— 
by photographic or video media. 

Some traditional features that result from projecting normal three-di- 
mensional scenes on twc dimensions appear in Figure 2: these include linear 
perspective, familiar size, relative size, and interposition. Note that even 
a perfect picture produced in this way is inherently ambiguous, in that both 
the flat surrogate and the very different three-dimensional layout it repre- 
sents offer the same light to the eye. This is the aspect of pictures that 
made them, and visual perception, of interest to the philosophers — the 
epistemological issue of how we can know what is true. Philosophical 
concerns aside, the ambiguity is inherent as a matter of simple mathematics, 
and provides both the opportunity for pictorial communication and a tool 
for psychological and physiological inquiry. 

The artist who learns to use signs of depth, as in Figure 2, can produce 
surrogates of scenes that do not and perhaps could not exist — virtual scenes 
of grottos, unicorns, and biblical and extraterrestrial events. Indeed, we 
shall see that in the interest of visual comprehensibility it is necessary to 
depart from pure projection, and most pictures are therefore to some extent 
surrogates of virtual rather than actual scenes. 

Today computers provide an increasing proportion of the still and moving 
pictures that humans confront. For them to do so programmers must learn 
how to project three-dimensional layouts in two-dimensional arrays and to 
generate the play of light and shade by which different surface textures are 

FIGURE 2 The major Tracings on th« picture plane 

pictorial (monocular) 
depth cues: the tracing of 
the scene in Figure IB. 
Linear Perspective: paral- 
lel lines 6-8, 7-9, etc., 
converge in the picture 
plane, interposition: the 
nearer object 4 occludes 
part of the farther object 
5. Relative Size: the trac- 
ing of boy 1 is larger 
than that of boy 2. Tex- 
ture-Density Gradient: 
the evenly spaced bars on 
the field 6-7-8-9 project an image whose density increases with distance. Familiar 
Size: if man 3 is known to be larger than boy 1, and they are the same size in the 
picture plane, then the man must be proportionally farther away in the represented scene. 
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perceived (Figure 3). The study of such rules— traditionally called depth 
cues (Woodworth, 1938) and lately called "ecological optics" (Gibson, 
1979)— is fundamentally a branch of physics, but one that must be pursued 
with the psychological and neurophysiological limitations and contributions 
of the human viewer firmly in mind. 

Surrogates are therefore more than means of pictorial communication: 
they tell us about the limits of the information that the sense organ can pick 
up and about how the brain organizes that information. Perhaps the earliest 
major instance of that point was in Newton's (1672) famous experiment in 
visual sensation, showing that an appropriate mix of three narrow wave 
lengths of light — bands of color taken out of the spectrum, such as red, 
green, and blue — can serve as a surrogate for any and all colors in the 
spectrum, and thys match any scene (Figure 4). This is not a fact about 
photic energy — the light itself remains unchanged by the mixture. It is 
instead a strong clue about our sensory nervous systems, and it provided 
the background for the classical theory of perception and the nervous system, 
which we consider next. 

PSYCHOLOGY AND PHYSIOLOGY FROM 1850-1950 

Given the facts of color mixture, the most parsimonious model of visual 
perception was the Young-Helmholtz theory (Helmholtz, 1866): that color 
perception is mediated by three kinds of specialized receptor neurons, the 
cones, each responsive to most of the spectrum, but each with a different 
sensitivity function. The three types were thought to be most sensitive to 
light that looks red, green, and blue, respectively, and their response to 




FIGURE 3 Computer-drawn image. A picture programmed directly from blueprints 
of a building, using a polygon facet approach with a simple lighting model that simulates 
direct sun and diffuse sky illumination. Paul Roberts, Computer Vision Lab, Columbia 
University. 
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FIGURE 4 Color surrogates. To the visual system G, a - 'ible mixture F of just 
three wavelengths selected by slits E from the visible spectrum D can, in principle, be 
a surrogate for any hue C, that is, any set of wavelengths in the spectrum. According 
to the traditional Young-Helmholtz theory, the physiological explanation involves three 
types of retinal cone cells with the ihree sensitivity functions shown in H. From these 
we can see, for example, that a mix of equally effective intensities at 650 and 530 is 
indistinguishable from 580 and could serve as a surrogate for the latter. 



photic energy was thought to underlie the experience of those colors. The 
retina was envisioned as a mosaic of independent triads of the three cones 
(Figure 5), and the light provided to the eye by any scene was thought to 
be analyzed into the point-responses of the three component colors. The 
research most directly relevant to this theory was the attempt to map the 
sensitivity of earh type of cone to the wavelengths of the visible spectrum 
and to map the spatial resolution of the retin? mosaic — what detail the eye 
could be expected to resolve. 

Such information as the limits of resolution and the bases and specification 
of colors provider the first goals for what has become visual science and 
its applications, which now run from the prescription of spectacles to the 
design of television characteristics. It was also the foundation of the classical 
view of Ihe perceptual process in general, diagrammed in Figure 6: at left, 
the ooject in the world, with its physical properties of distance, size, shape, 
reflectance (surface color). These do not affect the sense organs di ^ctly, 
of course, but only by means of the light they reflect to the sensitive :ells. 
All things that cause the cells to respond in some specific way elicit the 
same sensory experience: the light coming from the object itself, the light 
produced by some surrogate of th t object, the effects of mechanical or 
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FIGURE 5 The sensory mosaic. 
In the simplest view, the retina of 
the eye contains a mosaic of light- 
sensitive cells. The spacing of the 
mosaic determines what detail can 
be seen: e.g., to distinguish a "C" 
from an 44 0, M at least one cell (x) 
must go unstimulated. The visible 
portion of the eleruomagnetic ra- 
diation incident at each point in the 
retina that is capable of full color 
vision is coded into the output of 
each of three cones according to its 
sensitivity curve (Figure 4H). This is, of course, essentially 
the way in which video cameras analyze the light they receive from scenes. 
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FIGURE 6 The classical theory (1850-1950). The distal stimulus, D.S. (an object or 
layout of objects), with such physical properties as size, reflectance, position in space, 
etc., impinges on the sensory surface by way of the proximal stimulus pattern, P.S., 
consisting of regions that vary in their spatial extent (6) and spectral distribution [lu- 
minance (L), wavelengths (X)]. Sensory responses to eac.i region (sensations) were 
thought to vary correspondingly in brightness (L') and hue (the mix of Red, Green, 
and Blue) over some extent (6). Because of the regularitir: of the world and its geometry, 
the proximal stimulation will generally contain patterns (e.g., the cues in Figure 2) that 
are characteristic of ai therefore provide information about the distal properties. The 
perception of such properties (objects' sizes, surface reflectances, spatial location, etc.) 
were thought to derive from the underlying sensations by associative learning and by 
computational processes. 
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electrical stimulation of the eye, etc. Insofar as different objects and everts 
produce the same responses, information about the world is lost in this 
encoding process. This is what makes surrogates possible. And the fact 
that very different objects, and indeed different patterns of light, have the 
same effects on the nervous system provides a tool with which to study 
that system's structure and function. 

The visual system thus conceived is a mosaic of receptors (the retina) 
on which the eye's optical system projects a focused image of the light 
provided by the object The receptors (the three types of cone, supplemented 
by rods, which do not differentiate color) analyze each small region of that 
image into points of red, green, and blue. This conception of the visual 
system has now been embodied in the television camera: Television, like 
the Helmholtzian visual system, reduces the countless objects and events 
of the world to the different combinations of a set of three colors in a spatial 
mosaic. It is important both for the Helmholtzian theory and for television 
as a medium that such a simple set will suffice. In both cases, all of the 
remaining properties of the objects that we perceive in the world — their 
sizes, forms, and reflectances (i.e., surface colors), their distances and 
movements — are lost in the encoding process and must be supplied by the 
viewer. 

The simplest theory about such nonsensory processes was inherited from 
centuries of philosophical analyses of perception: the theory that we have 
learned the perceptual properties of objects from our experiences with the 
world. It runs as follows: 

The sense organs analyze the world into fundamental sensations. 

Those sensations are, in the case of vision, the sets of points that differ 
in hue (R, G, B in Figure 6, signifying red, green, and blue sensory 
experiences) and brightness (L') over some effective extent, 8'. These 
packets of sensations normally come in characteristic patterns that are im- 
posed by the regularities of the physical w^rld, patterns such as the depth 
cues in Figure 2. By learning these regularities and their meanings, we 
learn to perceive the physical world and its properties. 

The theory seems to be economical and elegant. The principles of learning 
appeared to be at hand. For almost two centuries (from Hobbes in 1651 to 
James Mill in 1829), the British empiricist philosophers had discussed how 
the "laws of association," offered in essence by Aristotle, could serve to 
build our perceptions and ideas about the objects and events of the world. 
And a plausible neurophysiological explanation of association readily of- 
fered itself in terms of increased readiness of nerve cells that had been 
repeatedly stimulated simultaneously to fire together. 

This outline of how we perceive objects and their pictures fitted nicely 
into a general theory of knowledge and of science, spanning from neuro- 
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physiology to sociology and political science. With respect to the last, for 
example, the view that all our ideas about the world derive from our 
experiences with it leads readily (but not ineluctably) to the belief that 
human intelligence and character are generally perfectible through educa- 
tion, and to the advocacy of egalitarianism and individualism over a wide 
range of social and political issues. 

Although formulated by Helmholtz only one academic generation after 
his teacher, Johannes Mueller (1838), first undertook the scientific analysis 
of experience, what I am calling the classical theory of perception thus had 
wide and deep connections with the mainstream of Western tnought, and 
it remained the dominant theory in neurophysiology and psychology until 
the 1950s. 

It was not without opposition, however. Some opposition was based on 
a cluster of purely psychological flaws. Although, for example, the theory 
tells us which different stimuli will act as mutual surrogates — that is, which 
different objects will produce the same perceptual experience — it does not 
tell us what that experience will be like. It does not predict the attributes 
of the experience itself, i.e., it tells us that light composed of a mixture of 
650 nanometers (red) and 540 nanometers (green) is indistinguishable in 
appearance from light of 580 nanometers (nm) (yellow), but it gives us no 
basis for predicting how that appearance is similar to and different from 
other colors. As we will see, alternative theories, almost as old as the 
Helmholtzian one, offer much more in the way of accounting for appear- 
ance. Notable among these proposals based on phenomenology (the study 
of appearances as such) were the following: Hering (1878) argued that 
perceived colors comprise red-green, yellow-blue, and black-white oppo- 
nent systems; that connections between cells of the two retinas provide for 
an innate sense of depth; and that lateral inhibition between adjacent regions 
of the visual system make their appearances mutually dependent. Mach 
(1886) proposed (among other things) that such lateral connections provide 
networks that are sensitive to contours and not merely to incident energy. 

A related problem is illustr?* id in figure 7. In most situations in the real 
world, the local stimulation that is projected to the eye is not by itself 
information about object properties. Even if the two gray target disks on 
the cube are of identical lightness or reflectance (RJ, the luminance or 
photic energy each provides the eye is different (L b l^) because the illu- 
mination falling on each is different (E b E2). Again, even if the two vertical 
rods on the rif at are of the same physical size (S), the size of the retinal 
image each provides (0 x , 0 2 ) differs because the rods lie at different distances 
(D| , D 2 ). Nevertheless, we tend tc perceive such object properties correctly, 
desphe changing retinal stimulation. The classical theory held that this 
object constancy, as it is now known, is achieved when the viewer takes 
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R t = Lj/Ej = L 2 /E 2 S = DxTan e 

FIGURE 7 Object constancy. Although both target disks on the cube have the same 
reflectance (RJ, the luminances (L,, L£ differ to the eye because the illuminations (E,, 
E2) differ. Similarly, objects of the same size (S) provide images of different extent 
(0|, 62) depending on their distances (D,, D2). We tend to see objects 1 relatively 
permanent qualities, such as their reflectance and size, as constant even though the 
proximal stimulation they provide is in flux. In the classical theory we do this by 
learning to 4 ,ikCSS visual information according to the formulae R(refiectance) = 
L(luminance)/E(illumination), and S(size) = D(distance) x tangent of 6(visual extent). 

the conditions of seeing into account: in effect, by using the depth cues to 
perceive depths Dj and D 2 , and then using the latter to infer the object sizes 
from the retinal sizes (8j, 82); similarly, to use cues to perceive the illu- 
minations Ei and E2, and, using the latter, to infer the reflectances of the 
parts of the scene from their luminances. 

This explanation is now commonly called "unconscious inference/ ' Its 
opsratioj assumes that the viewer has learne * he constraints in the physical 
world (e.g. , that L= R x E, that S = kDtan0, etc.). These constraints, once 
learned, provide a mental structure that mirrors the physical relationship 
between the attributes of the object and those of sensory stimulation, per- 
mitting the viewer to infer or compute the former from the latter. A general 
form of this explanation is that we perceive just that state of affairs in the 
world that would, under normal conditions, be m^st likely to produce the 
pattern of sensory responses we receive. 

The learning processes that might underlie such computations have never 
been formally and explicitly worked out. What we would now call "lookup 
tables' ' (for example, with grouped entries for S, 0, and D) would be 
compatible with theories about associative learning. Helmholtz and others 
often wrote, however, as though we iearn to apply the rules that mirror 
those of the physical world; they did noi «ay explicitly, however, how such 
abstract principles, as distinguished from lookup tables listing the elements 
of sense data, are learned. 
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The Helmholtzian idea that our perceptions of objects rest on compu- 
tational or inferential process was, like the classical theory's failure to 
predict appearances, roundly criticized over the years as being uneconom- 
ical, mentalistic, and unparsimonious. Gestalt theory, which had a signif- 
icant impact in psychology and art theory between the two world wars, was 
particularly vocal in this regard. But the criticisms of the classical theory 
did not amount to much until the end of World War n. Then the needs of 
new technology (flight training, radar and sonar displays, etc.), the devel- 
opment of new instrumentation (notably, direct amplifiers that made the 
measurement of very small bioelectrical tissue responses common and re- 
liable), and the effects of grants that made the research career a viable 
occupation, all combined to turn the tables. As we will see, Helmholtz was 
right about the three cones and in some sense about the existence of mental 
structure and computation. But most of the rest of what lay between those 
points was wrong, and most of the alternative proposals that had been made 
by the critics of that dominant approach, especially those of Hering and 
Mach, were quite remarkably vindicated within a period of a very few 
years, after having been largely ignored for many decades. 

THE 1950s AND AFTER: "DIRECT* SENSITIVITY 
TO OBJECT ATTRIBUTES 

The two main arguments on which the classical theory rested were, first, 
that it was the simplest answer to the problem of analyzing the world of 
sensory stimulation, and second, that it was in accord with neurophysio- 
logical observation. In the 1950s both of these supports were withdrawn. 

Technically, as is widely recognized, the most important single advance 
in instrumentation was the microelectrode, which made it possible to record 
the activity of individual nerve cells in the visual system and brain of an 
essentially intact animal that is exposed to various sensory displays. It 
quickly became evident that most of the cells observed in this way respond 
not to individual points of local stimulus energy but to extended spatial and 
temporal patterns — to adjacent differences in intensity, specific features, 
a td movements in one rather than another direction (Figure 8). They appear 
to do so by means of networks of lateral connections, which were very 
much what Hering and Mach had argued. 

In the 1950s Hurvich and Jameson (1957) offered sensitivity curves for 
the red-green and yellow-blue opponent process cells that Hering had pro- 
posed, using procedures based on colors' appearances and not just on their 
discriminabilities (Figure 9A). 

They "titrated" the response that each of these hypothetical red-green 
and blue-yellow opponent pairs makes to wavelengths throughout the spec- 
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1 »GURE 9 Accounting for color appearance by opponent-process networks. The func- 
tions of the Young-Helmholtz theory in Figure 4H explain why three wavelengths suffice 
to match any color, but do not explain color appearance. Hering had proposed two 
kinds of units, one that responds with a red sensation to some parts of the spectrum 
and with a green sensation to others, and a second that responds either blue or yellow. 
These units, by their combined activity, would account for the appearances of all hues. 
Hurvich and Jameson (1957) charted the amount of these components in the appearance 
of each section of the spectrum (see text), suggesting the functions in (A) as the response 
curves of the two kinds of unit, and suggesting a simple network (B) to encode the 
responses of the three kinds of cone (a, p, -y) into the opponent process hues plus black 
and white (Hurvich and Jameson, 1974). Anticipated and guided by these analyses cf 
perceptual experience, opponent process cells have been identified and studied by 
neurophysiological means (Svaetichin, 1956; DeValois, 1968). 



to what may be thought of as sine-wave gratings of a particular frequency 
(Blakemore and Campbell, 1969), to disparities in th two eyes' views 
(Barlow et al., 1967), etc. Even though the Helmholtzian model (Figures 
4-6) may be the simplest, we must conclude that it does not accord with 
the neurophysiological facts. 

These new neurosohysiologtcal structures raise two questions: How do 
they themselves work, and what perceptual functions do they serve? 

With respect to how these structures work, they are widely believed to 
result from the activities of suitably interconnected network? of lateral 
inhibition and excitation (von Bekesy, 1960; Ratliff, 1965), like the sketches 
in Figures 8 and 9: Jiis was very much what Mach and Hering had speculated 
to be the case. 

With respect to their possible perceptual functions, such pattern-sensitive 
networks open the way to very different kinds of explanation of the per- 
ceptual process. One of these is that the visual stimulus is analyzed into 
fundamental elements that do a great deal of what had been considered the 
task of learning and of unconscious inference. Three examples (hat have 
been given a great deal of attention will be mentioned and must stand for 
a larger number of such proposals. The first is that our visual world might 
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be assembled out of such fragments as edges and corners, providing a sort 
of feature list of which all scenes must be composed. The physiological 
mechanisms for such analyses could be provided by receptive fields in the 
striate cortex of the brain that are responsive to lines of a particular ori- 
entation anywhere within a local region of the retina (Hubel and Wiesel, 
1962, 1968), wu . cells in the inferotemporal cortex responding to primitive 
shapes — or even faces — as stimuli, independent of position cr orientation 
(Gross and Mishkin, 1977; Perrett et al., 1982). 

A second class of alternatives is to find neural structures that respond 
directly to specific properties in sensory stimulation that are themselves 
directly correlated with the distal, physical properties of objects in the world. 
Thus, cells that are sensitive to a disparity in the two eyes' images might 
provide a visual mechanism (Barlow et al., 1967) that is directly sensitive 
to an object's distance, as Hering originally argued. This possibility can 
be entertained, however, only to the very limited degree (see Gogel, 1984) 
that binocular space can be considered in such point-by-point fashion; in 
general, we must deal with extended patterns of stimulation and therefore 
with spatially organized and extended neural mechanisms. 

Spatially organized and extended neural structures are exemplified in a 
third class of alternatives that is based on the following idea of spatial- 
frequency channels: A sine-wave grating is a set of dark and light bars in 
which the intensity of the light varies in a sine wave. The width of the bars 
in such a grating defines its spatial frequency (i.e., the number of bars or 
cycles per degree of visual angle): high spatial frequencies mean fine detail, 
and the light-to-dark ratio (or contrast sensitivity) needed to discern the 
bars of each frequency characterizes the acuity of the visual system in terms 
that are compatible with those used to evaluate television transmissions and 
displays (Schade, 1956). 

But such spatial frequencies are more than just a uceful engineering 
measure. Because the rings of lateral inhibition that surround each stimu- 
lated point in the peripheral visual system come in different sizes, cells in 
the visual system are differentially responsive to oatial frequency. Cells 
have been found in the cortex that respond ele^. ophysiologically to a 
particular range of frequencies within a lestricted range of orientations in 
the retinal image (Movshon et al., 1978; DeValois et al., 1976). Moreover, 
in rough correspondence to these facts, viewers' abilities to detect com- 
binations of sinusoidal gratings (Campbell and Robson, 1964; Graham and 
Nachmias, 1971), and the aftereffects of exposure to a particular grating 
(Blakemore and Campbell, 1969; Pantle and Sekuler, 1968), both suggest 
that different spatial frequencies are being processed by separate channels. 
The relationship between such channels and the physiological finding of 
specialized response is not clear, nor is it clear what perceptual function, 
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if any, they serve. They have been proposed as the fundamental units of 
analysis of the patterned retinal image: Analogous to the channels of color 
in Figures 4 and 9, channels of differing rpatial frequency and orientation 
might perform what amounts to a two-dimensional Fourier analysis on the 
retinal image (Campbell and Robson, 1964; Ginsburg, 1971; Kabrisky et 
al., 1970; see Graham, 19S1). They have also been invoked by many 
researchers to explain a variety of phenomena in form and motion percep- 
tion, but their actual role in the perception of objects and events remains 
in question (see recent reviews by Braddick et al., 1978; Cavanagh, 1984; 
Foster, 1984; and Graham, 1981). 

Many of the present studies searching for the mechanisms of sensory 
analysis depend on the use of microelectrodes, but units of sensory analysis 
much like these had been investigated long before the microelectrode was 
developed. For example, by showing that prolonged exposure to a particular 
stimulus event provides the kind of aftereffect that one would expect to 
find if a receptor were depleted or 4 'fatigued' ' by that exposure, an argument 
couH be made for the sensory nature of the response to the event. Thus, 
after exposure of the receptor to a set of horizontal stripes moving contin- 
uously downward, a stationary set of such stripes appears to move upward, 
supporting the argument that the perception of movement rests on a direct 
sensory response to motion (Wohlegemuth, J9H). This method has pro- 
liferated in recent years (see Graham, 1981;Harris, 1980), but such findings 
can be interpreted in other ways, and the search for new sensory units 
received greater legitimacy from the neurophysiological findings. 

If wc change what we take to be the units of sensory analysis, then what 
we attribute to more central processes must in general change as well. Of 
greatest theoretical significance are those sensory mechanisms whose output 
remains invariant even though the local stimulation at each point on the 
retina may vary, i.e., mechanisms that respond to aspects of the stimulation 
that covary directly with the physical properties of objects and events. For 
example, the frog's retina contains cells that respond not to the intensity 
of light in some part of the retinal image, but to the ratio of intensities of 
surrounded and surrounding regions (Campbeil et al., 1978). As has been 
realized since Hering and even Helmholtz, that ratio remains invariant 
regardless of changes in illumination — as long as b^th regions are equally 
illuminated — so that as sketched in Figure 10 equal ratios of luminance in 
the proximal stimulation (P.S.) signify equal ratios of reflectances between 
the object and its background as distal stimuli iD.S.). It has been argued 
therefore that our perceptions of lightness are responses to adjacent ratios 
of luminance (Wallach, 1948). Such mechanisms might explain the con- 
stancies directly, that is, no additional process of computation or inference 
need be postulated. They therefore make possible very different explana- 
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then direct response to an object and its 
background whose reflectances stand in 
the ratio R l /R 2 would remain constant 
regardless of changes in illumination, E. 



adjacent ratios of luminance (I^/I^), 



alternative explanations of perception, 
very different from the classical theory, 
become possible. For example, if there 
are networks directly responsive to 



FIGURE 10 A direct response to an 
object's reflectances. Given that visual 
neurons are organized into networks, 
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tions of how a given visual attribute of objects (color, size, form, distance, 
velocity) is perceived, explanations that need not draw on speculations either 
about learning or about computation. 

Indeed, because such proposals are useful as perceptual theories only 
insofar as they identify some aspect of stimulation that "specifies" (i.e., 
that is highly correlated with) some object property, they need not even be 
concerned with neurophysiology. The search for such directly informative 
variables of stimulation therefore actually antedates the neurophysiological 
discoveries (Gibson, 1950) and remains an influential approach today. 

The most sweeping and radical proposal of this kind is a direct theory 
for all of perception (Gibson, 1966, 1979): Our nervous systems 4 4 resonate tt 
to stimulus properties that remain invariant when the light at the eye un- 
dergoes transformations (e.g. , the optical flow patterns and motion parallax, 
Figure 1 1) due to relative motion between the viewer and the objects being 
viewed. 

This is of course very different from the traditional approach. The latter 
posed the original perceptual problem as this: How are we to account for 
the objects and layouts we do in fact perceive, given thai the light at the 
eye is ambiguous and can be provided by very different surrogates? And 
it solved that problem by appealing to associations and computations that 
the individual perceiver has learned from experiences with the world. To 
the earlier direct theories that opposed this answer (including those of Hering 
and Mach) and that aimed at explaining particular perceptual abilities, 
evolution has provided specific mechanisms that so constrain the viewers 
responses that they will usually be the correct solution. Some of the newer 
direct theories seek a much more general principle and are therefore not to 
be identified with some specific physiological mechanism. 

The "invariance" principle is the most general explanation of this kind 
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FIGURE 1 1 Information about layout provided by motion. The views that an observer 
moving from point 1 to point 3 in A would have of three fixed posts i, ii. iii are shown 
in B. The motion parallax in those views provides information about the objects' spatial 
layout and sizes. For example, although the same objects at different distances provide 
images of unequal size, and arc displaced by different amounts (vectors iv, v in B3), 
the ratio of image size to parallactic displacement should be invariant. Gibson (1951, 
1966) has emphasized several ways in which the changing pattern of light to the moving 
observer, such as the optical expansion patterns in C, provide potentially usable in- 
formation about spatial layout and offer invariants that, if responded to directly, might 
explain the perception of distal object properties. 

to have been offered: Most objects and parts of the environment do not 
themselves change in form (as smoke or fog do), i.e., are rigid. When 
applied to these cases, the invariance principle means that we perceive those 
unchanging, rigid shapes and layouts in the world that project the changing, 
nonrigid two-dimensional patterns of light to the eye. This assumes that 
our nervous systems perform the required "reverse projective geometry" 
(Johansson, 1980), and that wherever the projected light at the eye permits 
a rigid source to be perceived, it will be. 
Because such theories can only account for perception obtained by mov- 
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ing observers, they take the perception of still objects and pictures to be a 
special case, governed by special and unknown principles (Gibson, 1951, 
1979; Johansson, 1980). In this view, normal perception occurs only when 
an observer moves about in a natural environment; research done in other 
situations is artificial and therefore misleading about the nature of our 
perceptual systems as they have evolved. 

Both within and outside of this approach, this rigidity principle has 
recently become quite popular. Directly or indirectly (in the form of the 
assertion that we perceive the invariant), objectwide or more locally, a 
rigidity principle has been adopted by many psychologists (Gibson, 1966, 
1979; Johansson, 1977, 1980; Rock, 1983; Shepard, 1981; Todd, 1982) 
and computer scientists (Marr, 1982; Ullman, 1979). There are at least 
three reasons why this principle is theoretically attractive. Exploring those 
reasons, and why the rr'e must nevertheless be rejected in any strong form, 
will provide a convenient survey of a critical part of the present landscape 
of perceptual inquiry. 



It is easy to see how learning by association might invest specific patterns 
of stimulation with specific perceptual meanings, and to speculate about a 
neurophysiology basis for such associative learning, but it is harder to 
be specific about a learning process through which abstra.: rules might be 
learned. (This is the distinction, made earlier, between 4t lookup tables" 
and an inference or computational process that solves some internalized 
formula). Criticisms of the classical theory are often simply demonstrations 
intended to show that perception is determined by rules rather than by 
familiar associations, rules that operate without, or even against, familiar 
patterns. 

This was the central thrust of Gestalt theory, which mounted a serious 
challenge fr > Helmholtzian theory between the two world wars — to find 
such rules, and from them to deduce the nature of the underlying brain 
processes. These rules, called the "laws of organization," were held to 
determine whether we will perceive some object at all (Koffka, 1935; 
Kohler, 1929). Fig re 12A is a demonstration of the "law of good contin- 
uation": a familiar number is concealed in i — but not in ii and iii — because 
the configuration in i requires us to break the unfamiliar but smoothly 
continuing shape in order to see the number. These rules were also held to 
determine whether flatness or tridimensionality is perceived (Kopfermann, 
1935). In Figure 12C, the pa'tern looks flat because the good continuation 
must be broken to perceive (1) and (2) as dihedrals at different distances, 
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FIGURE 12 Organization and its limits. Before microelectrodes showed extensive 
cross-connections to exist, similar interaction had been postulated by Gestalt theorists 
to explain "laws of organization" as demonstrated in (A) and (B). 

(A) Good continuation: a number is concealed in (i) by the smoothly continuing 
lines that embed it (ii), but not by mere clutter (iii). (B) Gestalt factors in conflict. In 
(i), we perceive a sine wave crossing a square wave, against factor of closedness, 
which would otherwise yield the perception of closed shapes \n). (C,D) By the minimum 
principle — that we see the simplest organization — (C) looks flat and (D) looks tridi- 
mensional because (C) is simpler as a flat pattern than (D). 

(E,F). The evidence is against such global organization. While you gaze at inter- 
section (1) in (E), the vertical line soon appears nearer than the horizontal, which is 
inconsistent with the simple figure fixed by intersection (2), and does so even with a 
moving, tridimensional cube (Peterson and Hochbe^, 1983). (F) An impossible, yet 
apparently tridimensional, picture. 
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but Figure 12D looks tridimensional because the dihedrals would hav. to 
be broken at (1), (2), etc., for the pattern to look • a set of cl .se-* 
polyhedra. 

On a practical level, such demonstrations remind us thai we cannot 
assume that a picture wili be comprehensible if only it is an accurate 
projective surrogate (as in Figure 1) and as long as the object represented 
is itself a recognizable and familiar one. Every amateur photographer learns 
that flowerpots and lampposts lurk in the background, ready to appear in 
the picture looking very much as though they are growing out of the sitter's 
head. And any text on protective coloration shows the striped tiger or zebra 
disappearing into its cluttered background. 

On a theoretical level, such demonstrations have been used to argue that 
associative learning does not determine perception: In Figure 12Ai specific 
familiarity is overcome by what seems to be an abstract confrgurational rule. 

The literature contains a large number of Gestalt rules, but each is sup- 
ported only by a few unquantified and untested demonstrations. Nor have 
the rules been used to explore brain processes. But they do appear to be 
of the utmost importance inasmuch as they seem to determine what shape 
or object will be perceived. Because several Gestalt rules usually apply in 
any real case, however, and because they will as likely as not work against 
each other, they are not of much use in their present state, lacking quan- 
titative measurement and with nu combinatorial rules of any kind. It is not 
true, as some computer scientists and neurophysiologists have claimed, that 
these rules have been abandoned because they were inherently subjective 
and unverifiable (e.g., Marr, 1982). They stand neglected rather than aban- 
doned. The fact is that, until recently, only a handful of scientists were 
concerned with the problem of organization, and they were deflected by 
two more promising lines of attack on that problem which seemed to offer 
themselves in the 1950s. 

The Promise of a Minimum PrincipU To make the insights of Gestalt 
psychology scientifically or practically u: efiil we need either a great deal 
of quantitative and object measurement of the strengths of the different 
rules, along with an appropriate combinatorial principle, or some equally 
quantitative and objective overarching rule that supplants the set of indi- 
vidual rules. For the la^r purpose Gestalt psychologists offered a minimum 
principle y i.e., that we perceive the simplest organization — the simplest 
alternative object or arrangement— that fits the stimulus pattern (Koffka, 
1935). Attempts <vere initiated in the 1950s (Attneave, 1954, 1959; Hoch- 
berg and MacAlister, 1953) to formulate an objective minimum principle, 
one that would require no intuitive judgments in order to apply it. It would 
rest instead on measuring each of the alternative objects that could fit the 
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stimulus, to decide which alternative is simpler (e.g., number of dihedrals 
or edges, number of inflection points, etc. [Hochberg and Brooks, I960]). 
With an objective and quantitative rule of this sort, a computer could in 
principle assess any picture before displaying it, and then select for display 
only those views for which the object to be represented is in fact the simplest 
alternative (e.g., Figure 12D rather than 12C). 

Although no computer programs that would apply these principles to 
image generation have actually been attempted to my knowledge, devel- 
opment of this approach continues today (Buffart et al., 1981; Butler, 1982; 
Leeuwenberg, 1971), and it has recently been applied as well to the per- 
ception of simple ambiguous patterns of moving dots (Restle, 1979). Such 
research would be theoretically important if a minimum principle were in 
prospect and practically important even if all it did was contribute to solving 
the problems of object representation. But both its theoretical and practical 
meaning must be questioned in view of facts that have been known to 
perceptual psychologists for decades. These facts tell us that stimulus mea- 
sures alone cannot provide a general explanation or prediction of object 
perception. This will receive increasing stre r s in the balance of this paper. 
Here we note that in Figure 12E (p. 266), the place that one attends de- 
termines how the "tject is perceived: when one attends intersection (2), 
the cube is so perceived that the vertical edge is the nearer, in accordance 
with both the rule of good continuation and with any simplicity principle; 
when one attends intersection (1), the perspective soon reverses, against 
the good continuation at the other intersection and against overall simplicity 
(Hochberg, 1981). 

Both real and pictured objects exhibit this phenomenon (Gillam, 1972; 
Peterson and Hochberg. 1983). These demonstrations introduce us to the 
fact that the viewer's attention, and not merely the measurable pattern of 
stimulation, helps determine what is perceived. (We will return to this point 
shortly.) With respect to tht> minimum principle, Figure 12E is completely 
incompatible with any rule based on the entire object. On the other hand, 
it is not evident how a minimum principle based on separate parts of an 
object can even be formulated and tested. In any case, no advocate of the 
application of the minimum principle to entire figures has yet attempted to 
deal with this problem, despite the fact that it was clearly Kplied by 
discovery of the famous "impossible figures" by Penrose and Penrose in 
1958 and by their popularization in the graphic art of Maurice Escher. The 
object in Figure 12F (Hochberg, 1968) appears tridimensional and contin- 
uous, even though careful inspection of the two sides shows them to be 
inconsistent. If the distance between left and right sides is made very short, 
the figure then becomes flat, and the inconsistency more evident, although 
the minimum principle is then no more or less applicah 1 *. 
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Let us hsxt consider the other factor that deflected attention from the 
objective study of organizational principles, the assumption that "iey applied 
only to stationary drawings. 

The Doctrine That Event Perception Is Both Fundamental and Veridical 
As Leonardo noted in the fifteenth century, a two-dimensional picture 
cannot provide a moving viewer with the motion parallax that would be 
provided by the three-dimensional scene it represents. As the viewer moves, 
nearer objects in a three-dimensional scene are displaced more in the field 
of view than are farther ones (Figure 13). Because the spatial relationships 
between the parts of the flat picture all remain fixed, the picture is no longer 
a surrogate for the scene. The relative motions produced by a given dis- 
placement are (with certain constraints or assumptions) specific to the layout 
of the points and surfaces of the scene in space. The differential motions 
within the stimulus pattern offered by the scene provide the moving observer 
with rich information about the structure of the world. A critical question 
being explored today is how much of that information is used, and in what 
form. 
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FIGURE 13 As Leonardo knew, pictures are not surrogates for a moving viewer. In 
A, a viewer moves from x to y. If the display is a picture, all parts are displaced equally 
in the field of view B, but if it is a window, objects at different distances undergo 
different parallax C. Those who take the invariants of the moving stimulus array to be 
fundaments! to perception (sec Figure 1 1 ) have yet to explain how ; t if that we perceive 
pictures. 
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The precision and ease with which we can study viewers' responses to 
moving patterns, and to other stimuli that change over time, depends of 
course on the equipment with which such stimuli can be produced and 
presented. Un il the early 1950s only simple mechanical and electricai 
devices were in general , :e. Since then the dissemination of relatively 
cheap 16-millimeter motion picture cameras capable of producing controlled 
motion through animation, the advent of even cheaper and more convenient 
video er- * ; nment, and, above all, the availability of computer-generated 
displays, have progressively revolutionized the study of patterns that change 
with time. We are now in the midst of an explosion of research on the 
topic, done as much by computer scientists, physicists, and neurophysiol- 
ogists as by perceptual psychologists. 

Even the earlier and more primitive apparatus contributed a wide array 
of facts, some of which have been neglected in the recent interdisciplinary 
renaissance. Much of the earlier research was not dii zctly addressed to 
questions of object perception but was intended instead to explore basic 
processes, e.g., the study of ihe time constants of the visual system's 
responses to flicker (Kelly, 1961), or the study of the conditi' ns that yield 
apparent movement with successive simp'*, static stimuli (Braddick, 1980; 
Kolers, 1972; Korte, 1915; Morgan, 1980). Some of the facts obtained in 
such research address the question of whether (and how well) our nervous 
systems respond to the stimulus changes that carry information about depth 
and motion (Figure 14). We know, for example, that our visual systems 
are extremely sensitive to motion parallax: even a very slight difference in 
distance between two aligned or nearby rods (Berry, 1948) and a very small 
head movement on the part of the viewer will provide a displacement in 
the retinal image that should be detectable (Helmholtz, 1866; Wheatstone, 
1 839). If two objects at different distances happen to line up from a particular 
view, therefore, and good continuation then provides a misperception of 
the object (as in Figures 12Ai [see p. 266], 14Bi), even a slight head 
movement should provide a detectable break in the good continuation. 

Moreover, the two-dimensional shadows or projections of irregular spatial 
arrangements of :ods, or of dots distributed in space, or of unfamiliar objects 
(Figures 15 A, B, C, respectively), Jacking other depth cues so that they 
are perceived as flat arrangements when stationary, are perceived as three- 
dimensional layouts when they are set into motion. Even more than the 
static Gestalt demonstrations, these phenomena seem difficult to explain as 
the use of a lookup table, learned by association, that the viewer can consult 
to determine the meaning of some previously encountered set of sensory 
events: How plausible is it that the viewer has encountered the particular 
pattern of moving randomly arrayed dots, shown in Hgure 15B, so often 
that by familiarity ;t has become a recognizable tridimensional l. rangement? 
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FIGURE 14 Motion parallax, binocular parallax, and their effects on adventitious alignments. At A, as the viewer moves from l to ii t 
with gaze fixed on iii, the view changes from iv to v. At B, if the "4" happens to be adventitiously in perfect alignment with the ends 
of the open loops in the background, a slight head movement to the right ii or left iii will provide a misalignment; this presumably should 
make good continuation (Figure 12A) inoperable except in static pictures. In C, without head movements, parallax is provided by the 
two eyes* views. (R, L arc right and left eyes; V R , V L arc their views.) 
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ABC 
FIGURE 15 Structure through motion: Precomputer methods of studying motion per- 
ception. Unfamiliar patterns that look quite flat when static appear tridimensional when 
put in motion, encouraging the formulation that we perceive the rigid (or invariant) 
object that would provide me changing stimulus pattern. Some well-studied older er 
amples are illustrated. A: Ths shadows of rods on a rotating turntable i, or the rods 
themselves, are viewed through an aper ire that occludes their ends (Metzger, 1934; 
White and Mueser, 1960). B: Tne shadow of a set of dots on a moving glass plane i 
is projected on a screen ii (Gibson and Gibson, 1957). Such displays were initially the 
easiest to program and study in computer-generated form (Green, 1961). C: Simple 
unfamiliar wire forms mounted on a turntable provide a "kinetic depth effect" (Wallach 
and O'Connell, 1953). 



It seems far more plausible that the phenomenon is the expression of a 
perceptual rule. 

We have seen that we can in fact use relative displacement to discern spatial 
structure. But that still leaves cpen the question of what the rule is by which 
we fit fliree-dimensional space to the two-dir.iensiortal but moving stimulus 
Pattern. As we have seen, the simplest and most general solution is that we 
extract that invariant object or layout that will fit the moving stimulation 
(Gibson, 1979; Johansson, 1980). This rule would account for the perception 
of rigid objects and surfaces without additional rules or constraints. Moreover, 
it includes the perception of motion pictures, and the phenomena represented 
in Figures 13 through 15, under the same g ,eral explanation. 

As computers have made it easier to generate pictures of points moving 
in space, and as more research is done with such patterns, the point first 
made by the Gestalt demonstrations — that perception is governed by rules 
rather than lookup tables— has taken hold. And although Helmholtz and 
the earlier psychologists to whom perception is the re^l' of learning often 
talked of what amounts to perceptual rules, no formal account has been 
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offered of specific mechanisms or principles by which such perceptual 
learning might occur. However, once a rule is explicated with precision, 
it becomes relatively easy to imagine neural circuitry that might underlie 
its working. There is therefore added incentive today to take a "nativist" 
stance — to propose explanations of perception that depend on innate pre- 
wiring rather than on learning processes. 

As we will see, the design of explanations in terms of such neural circuitry 
is a very active enterprise today, particularly in the field of computer 
science. But if that effort is to apply to human perception, it must start 
with perceptual rules that indeed are used in the human perceptual process. 
We still must decide what those rules are. 

The perceptual rule that is most readily and explicitly defined in physical 
terms is the currently popular rigidity principle. In fact, however, the strong 
forms of the rigidity principle will not work, for the perception either of 
objects in space or of their representations. Evidence that decisively refutes 
the strong form of the principle includes findings obtained many years ago, 
although the implications of these facts have not been adequately taken into 
account in most recent discussions. The same facts make the other over- 
arching principles, as they are presently conceived, equally unworkable. 
We survey some of that evidence next. 

Some of the points that follow have recently been made as well by 
Brainstem (1983), by Gillam (1972), and by Schwartz and Sperling (1983). 

Why the Strong Forms cf Various General Perceptual Principles Must 
Be Rejected A*vhough the overall case against these general rules cannot 
be reviewed here in detail, the strongest argument is simple and sufficient: 
Even when rigid moving shapes are in full view, we do not necessarily see 
them. In some cases we perceive instead quite different shapes undergoing 
nonrigid deformation. 

This has been known in a general way at least since 192? (e.g., von 
Hombostel found that a real, rotating wire cube reverses perspective even 
though it must then appear to stretch and bend), and a remarkably robust 
illusion known as the Ames "window" has been widely disseminated since 
1951: A trapezoid (often with shadows painted on it to "suggest" the 
perspective view of a window) rotates continuously in one direction (e.g., 
arrow vii as seen from above in mirror, M) either clockwise or counter- 
clockwise, in full view, as shown in Figure 16A. It is not seen as such. 
Instead, it is perceived as oscillating (arrow viii), reversing direction twice 
each cycle so that the larger end (i) (or iv in the mirror) always appears 
the nearer. It is as though a process of unconscious inference were at work, 
assigning depth on the basis of the static depth cue of linear perspective 
(Figure 2, see p. 251) and inferring direction of movement from relative 
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FIGURE 16 A classic illusion with a moving object. In A, a flat trapezoid, with 
.narkings painted on its surface to "suggest" depth, is seen from in front, with i aud 
iii equidistant from the viewer, and from above in the mirror M. At B, the trapezoid 
has a ^ so that the small edge iii is nearer the viewer. Ames (1951) found that 
though rotaung continuously (arrow vii in the top view) it appears to oscillate back and 
forth (arrow viii); see M. Although the rod, ii, is rigidly fixed to the trapezoid, h is 
correctly seen to rotate, passing through the substance of the trapezoid! The trapezoid 
cannot both appear to oscillate and yet remain rigid in appearance. The solid and dotted 
outlines in C are its shape as presented to the eye when edge iii or i is respectively the 
nearer. If seen to oscillate, the trapezoid must also appear to deform between these 
shapes, as shown by the arrows, although this nonrigidity is not normally very noticeable. 



depth. I hasten to add that although it is widely offered (e.g. t Ames, 1951; 
Gibson, 1979; Graham, 1963; Hochberg, 1978b) there is no experimental 
support for such an explanation of the phenomenon; indeed, there are 
features of the changing retinal image that might be direct, if misleading, 
bases of the illusory response (Braunstein, 1976; Hochberg, 1984b). (For 
example, even when the larger end swings away from the viewer, as shown 
by arrow vii in Figure 16B, a vector of expansion, ix t will generally be 
provided as the large end swings in toward the axis of rotation, ?jh! ex- 
pansion is normally a correlate of approach; cf. Figure 1 1C, p. 264.) 
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This illusion is extremely strong and difficult to overcome even when 
the viewer is confronting a real object (although in that case, one eye must 
be kept covered at close distances; at really close distances the true shape 
and movement may be seen monocuiarly as well). When the viewer con- 
fronts a moving picture, rather than the object itself, the illusion is almost 
irresistible. The virtual shape that then fits the perceived illusory movement 
to the changing pattern of light at the eye must then be nonrigid (Figure 
16C), and the percewed path must follow a complex and changing radius. 
Moreoever, the illusory 180-degree oscillations of the trapezoid are per- 
ceived even if a rod is rigidly affixed to the trapezoid, as in Figure 16A 
and B; the rod does not appear rigidly fixed, but pursues its 360-degn:5 
rotation, apparently passing through the trapezoid like a phantom when the 
trapezoid reverses its apparent course. (This is true even if the viewer is 
simultaneously shown the setup from above in a mirror, the rod and trap- 
ezoid are seen to rotate 360 degrees as a rigid unit in the minor, and, at 
the same time, to move in separate pans in direct view.) 

Thus, a truly rigid and invariant object, moving in a simple and invariant 
orbit, is not perceived, and instead a nonrigidly deforming and quite illusory 
object is seen, moving in a complex and variable path. 

This phenomenon, although widely popularized since 1951, has been 
virtually ignored by those who propose that our perceptions are determined 
by invariance, rigidity, or simplicity principles. I can find only brief mention 
of the phenomenon (Gibson, 1979), claiming that it occurs only when the 
motion-provided information is below threshold, and that then the illusion 
rests on unconscious inference. This highlights the question of thresholds, 
which must surely be considered before v z can say that any of the motion- 
produced information discussed in connection with Figure 1 1 (see p. 264) 
provides anything useful to the viewer, and which has yet to be addressed 
in any systematic evaluation of the direct theory (Cutting, 1983; Hochberg, 
1982). Moreover, by invoking unco.iscious inference, this way of dealing 
with the phenomenon spoils the direct theory's claim to parsimony. But in 
any case, that answer is wrong. Even when the changes provided by tie 
moving object are clearly above the C 'lection threshold and the illusion s 
therefore accompanied by clearly perceived nonrigidities, the latter is what 
we see, and not the veridical rigid motion (Hochberg, 1984b; Hochberg et 
al., 1984). 

Subtle argum *nts are not needed, however Given the lessons of Figure 
16, we can reaaily devise new illusions in which rigid simply moving 
objects, freely viewed (with monocular vision), are seen to bend and deform 
nonrigidly, as in Figure 17. Motion is not enough to ensure veridicality, 
therefore, and what is perceived may be perceived against any effects that 
simplicity, invariance, and rigidity principles might exert. 
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(i) 



(ii) 



A 



B 



HGURE 17 Apparent bending in a rigid, moving objcc:. A flat, rigid octagonal cutout, 
with markings painted on its surface to "suggest** depth, is shown in front view at A 
and from above at B. To mom rular vision, when it moves as shown in Bi, it tends to 
appear instead to hinge in the middle, and to "flap** away from the viewer as shown 
in Bii. Similar but less compelling effects occur without the markings, and with oval 
shapes as well (Hochbcrg and Spiron, 1985). 



COMPUTERS AND PERCEPTUAL PSYCHOLOGY 

The microeleclrode was a major technological watershed, and its e'/feos 
were quickly manifest. The introduction of the computer has had far greater 
effects, but they have been more diffuse, are slower in being realized, and 
are still growing, as computer science and technology change. 

There are six main ways in which the computer has affected perceptual 
psychology; although these ways are closely intertwined, they are also very 
different, and it is important »o separate them if one is to understand the 
relationship between the two disciplines. 

The first two uses are contributions that computers now offer every branch 
of science: obtaining and analyzing data, and modeling theories and ex- 
planations. 



The computer has of course radically changed the methodology of mea- 
surement and analysis. For example, the direction and changes in the sub- 
ject's gaze can be monitored and even used to control the display that 
confronts the eye (McConkie and Rayner, 1975), permitting the detailed 
study of how the integration of successive glances occurs in the process of 
reading text and perceiving pictures. Such research would simply have been 
impossible without high-speed and powerful computers; we will see that 
the problem to which this method is addressed is of central importance. By 
handling large quantities of numbers and rapidly executing operations that 
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were once prohibitively time-consuming and expensive, the computer makes 
available data that were in essence unobtainable. This is true both of phys- 
iological data and of judgments intended to tap perceptual experience. Thus, 
physiological signals that are normally far too weak to be distinguished 
from accompanying bioelectrical background noise can be adumbrated by 
computer methods that cumulate them over marv occurrences, making it 
possible to measure the electrical potentials (Donchin et al., 1978; Sutton 
et al., 1965) and magnetic fields (Kaufman and Williamson, 1982; Reite 
and Zimmerman, 1978) at the scalp. Such averaged transients reflect neural 
responses that the brain makes to sensory stimulation and that accompany 
perceptual processing. 



The second use of the computer, common to all science, is to model 
theoretical proposals and explanations for which it would otherwise be 
impossible or too laborious to say whether and how they would work. 
Whether some hypothetical neural network would respond as designed 
(Hebb, 1949; Mair, 1982; Rashevsky, 1948; Rosenblatt, 1962), whether a 
particularly defined set of flow patterns (like Figure 16C) would specify 
uniquely a set of surface forms in the world (Ullman, 1979), whether a 
particular history of strengthened associations would even theoretically re- 
sult in perceptual learning (Hebb, 1949; Minsky and Papert, 1069; Rosen- 
blatt, 1962) — these are questions that cannot be answered simply by 
considering them in verbal form but that can often be answered once the 
functions are stated specifically enough to run as a computer program. 



A second branch of computer science aims at embodying perceptual 
functions, similar in effect to those of humans, in computer hardware and 
software. We must distinguish two distinct purposes that gnde this enter- 
prise. One is to design and provide devices that can serve instead of hu~ians. 
Some of these functions are readily achieved (the senso.s that open super- 
market doors, the bar-code scanner that identifies and prices items at the 
checkout counters), and some are probably unachievable in the foreseeable 
future (e.g., machines that respond to or translate free and normal human 
discourse); but in general there is no compulsion to s*:ve each function in 
the same way »hat humans do. Human perceptual functions here serve only 
as "existence proofs that assure the computer scientist that at least one 
way of solving the problem exists and is embodied in human neuroanatomy. 

Once we start to consider the means which modern electronic com- 
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puters might perform such tasks, however, we develop new ways of thinking 
about how the human nervous system performs its perceptual tasks. The 
computer then serves as an analogy or even a model for the study of human 
perceptual processes. That may turn out to bf the most important relationship 
of all between computer science and perceptual psychology, and we consider 
that next. 

The Computer as an Analogy to Perception 

Perhaps the greatest effect of the computer has been its influence as an 
analogy: Inherently vulnerable to entrapment in the mind-body problem of 
philosophers and metaphysicists, and self-conscious about the need to be 
scientific, psychology is always tempted to confine its attentions to variables 
that arc conceived and measured in physical terms. Indeed, almost since 
J.B. Watson's behaviorist manifesto in 1913, physical measurement and 
physical (or at least physiological) conceptions have enjoyed intellectual 
hegemony in this country. 

There was of course continuous opposition, both on scientific and me- 
taphysical grounds, and the field of perceptual psychology by its very subject 
matter was less constrained by behaviorism than other fields of psychology, 
but for that very reason it was almost abandoned as a discipline for some 
two decades. It was not until the late 1950s that what can only be called 
"mental" conceptions and measures once again became scientifically re- 
spectable to the rank and file of the profession. I am convinced that the 
main factor in this change was the obvious fact that computer programs 
are in principle transportable to very different physical machines. They can 
therefore be analyzed and discussed in abstract functional terms without 
reference to the specific hardware in which they must be embodied to 
perform. Familiarity with computer functions, terminology, and flow charts 
made it possible to describe what the mind might be doing in a way that 
could, n> principle, be instantiated in a program and then embodied in a 
machine (Miller et al., 1960; Rosenblatt, 1962; Selfridge, 1959). 

Something like this had already been done repeatedly, long before com- 
puters v/ere developed, from Descartes' design in 1650 of a hydraulic model 
underlying neural function, to Tolman's analysis of purposive learning by 
a "schematic sowbug" in 1938; but there was never any real likelihood 
that the analyses might be put to the test by building the machines. The 
general-purpose computer and transportable programs have made the point 
much more powerfully. 

The language of cognitive psychology is now very close to the language 
of computer science. There is usuallv no guarantee that any given flow 
chart with which e cognitive psychologist offers to explain some phe- 
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nomenon can in fact be translated into an executable program, but if not, 
it is inadequate because it is vague or inconsistent and not because it is 
mentalistic. 

Computer Science in Perceptual Psychology Research 

The attempt to design machines that embody human perceptual functions 
(or to design programs that model such machines) rests on the belief that 
only in this way can we be sure that we have achieved a scientific under- 
standing of those functions. This is an old belief and undertaking, but the 
advent of the modern computer makes the venture seem more plausible. 
Given its purpose, this undertaking must start with scientific empirical 
knowledge of how humans perceive. That is of course precisely what the 
task of perceptual psychology has been. In consequence, the two disciplines 
now overlap greatly, and an increasing amount of perceptual (and cognitive) 
psychology is currently being done in computer science departments. 

This is a very promising development, and some of the work has received 
wide attention as a "breakiirough"; but it would be unwise to overestimate 
what has been accomplished at this early stage (see Braunstein , 1 983; Haber, 
1983). The approach ensures precise modeling of theories but does not by 
itself provide either new theories or new facts about human perception. 
The point is worth spelling out in a brief examination of the field. 

Because it is far easier to make initial progress at formulating specific 
models of direct neural response to stimulus information than at formulating 
specific models of central processes of learning i 1 inference, most of the 
work in this field has concentrated on the former iee Haber, 1983). As a 
first stage, any perceiving machine must be able to separate objects from 
their cluttered surroundings; this problem is ve**" difficult to deal with in 
still pictures (Oately, 1978, Roberts, 1965). We have seen above that the 
problem is mathematically less refractory, given the multiple views provided 
by motion parallax and binocular parallax (see Figure 14A and C on p. 27 1) 
in that fewer constraints are needed to specify the three-dimensional layout 
that would produce the stimulation at the eye. It is understandable, therefore, 
that computer scientists have recently turned to models of binocular ster- 
eopr j (Marr, 1982; Marr and Poggio, 1979) and of the perception of 
structure through motion (Marr, 1982; Ullman, 1979). 

These "computational" models are totally within the mainstream of 
perceptual psychology (although that is not always clear from their pre- 
sentation, nor from their reception). For example, the computational model 
of binocular stereopsis devised and tested by Marr and his colleagues was 
a relatively slight variation of a detailed theory published in 1970 by Sper- 
ling, a psychologist; Sperling's theory is itself well within that class of 
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psychologists' explanations of stereopsis (see Kaufman, 1974) that have 
taken Johannes Kepler's (161 1) geometrical analysis of the binocular lines 
of sight that obtain when viewing objects in space as the model of an 
internal "binocular neural field" that merely reflects that geometry. And 
Ullman's computational analysis of the information that moving stimuli 
give about their layout in space takes ^s place within the long tradition of 
such analyses and research. Neither of these computational perceptual the- 
ories can claim to be more than partial accounts of the phenomena in the 
domanp they address. For -xample, even with unimpeded binocular par- 
allax, we perceive the concave n.old of a human 'ace lit from below as a 
convex hcz : from above; even with unimpeded motion intormation, as 
we have seen (Figi *es 16-17), at I^a* 1 some rigid objects *ie perceived as 
honrigid and in wrong slant and motion. These computational theories do 
not differ frcTi other attempts at sensory explanation of object percept.jn 
in their inability to deal with such problems. They differ only in that they 
are restricted to models that can be successfully run as computer programs, 
and that is not necessarily an unalloyed virtue. 

Although computer simulation and "computer perception" have received 
considerable praise in recent years, there are grounds for criticism as well. 
The need to devise perceiving machines that work as humans do is certainly 
not a valid economic argument— one can usually find far more direct means 
of performing specific tasks. Nor is computability a necessary criterion for 
assessing any theory, regardless of how desirable that quality may be (and 
despite the stress on simulation studies currently evident in many quarters). 

But these arguments are moot. Regardless of the intrinsic merits of 
computer simulation and of the quest for perceiving machines, and without 
appeals to metatheory or philosophy of science, there remains a present 
and growing need to develop theories of human perception to the point that 
they can he embodied in computer programs. That is the relationship be- 
tween the computer and perceptual psychology that we consider next. 



Computers communicate to their human users through pictures as well 
as through words and numbers. But more than that, they are increasingly 
used specifically to generate pictures: as interfaces between the viewer and 
some part of the world that would otherwise be difficult or impossible to 
see; as means of visualizing designs of buildings, machine parts, molecules, 
chromosomes, or cellular processes; as substitutes for human artists and 
animators in creating graphic displays for advertising and entertainment; as 
simulators in flight training. The use of such devices is already great, and 
growing rapidly. In many cases, the pictures (or pictorial sequences) that 
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are displayed were not themselves programmed and were never previewed 
in any sense, but are generated in response to the question that the user 
asks or in reaction to something that the user does. An example of the 
former would be what an architectural layout or a machine part will look 
like from some chosen location, or what the state of a flow chart would be 
under some specified conditions; an example of the latter would be a low- 
altitude flight simulator display, which depends, of course, both on the 
terrain being simulated and on the individual pilot's actions. 

Without a human editor intervening between computer and user, there 
is some unknown likelihood that the viewer will be shown misleading or 
incomprehensible pictures. Where that likelihood must be minimized, the 
computer must avoid certain classes of pictures, or must be prepared to 
enrich or enhance those pictures. This means that we must be able to specify, 
in terms acceptable to a computer, how humans will perceive a pictuie or 
a sequence of pictures. 

The study of the rules of representation is now a vigorous and growing 
field in which perceptual research finds practical application (Cutting and 
Millard, 1984; Habei and Wilkinson, 1982; Stevens, 1983; Todd and Min- 
gola, 1983). Although this task shares much of what we must learn through 
exploring analogies between computers and human perception, it is also 
significantly different. It cannot ignore as mere embarrassments the cases 
in which we misperceive, the exceptions to proposed generalizations — 
indeed, it is just those cases that must be the focus of inquiry. And that is 
fortunate for psychology, because those are the cases that test the generality 
of any perceptual theory. 

A superficial answer to the question of how we can ensure comprehensible 
pictures is to increase the fidelity of the surrogate — i.e., to make the light 
to the eye more like that provided by the object or scene that is being 
represented. That means improving the resolution and the color balance, 
avoiding distortions, etc. Indeed, if other things are equal, an improvement 
in these engineering factors will usually improve picture comprehensibility. 
But we have seen that even perfect fidelity — i.e. , the moving object itself — 
may result in misperceptions (Figures 16, 17). The constraints on mental 
structure — on the structure of perceived objects — are not the same as the 
normal constraints on physical objects, and we must know the former as 
well as the latter if we are to be able to predict how pictures are perceived, 
even with the best picture quality possible. 

There are practical limits, moreover, to the pictorial information that we 
can count on. One can see detail in a closeup, or an entire object or scene 
in a long shot, but not both. Picture quality is limited, and the techniques 
that motion picture and television filmmakers have developed to cope with 
those limits — surveying or scanning an object or scene by successive partial 
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views or closeups — require the viewer to go beyond the momentary sensory 
input, and to enter and store the successive partial views in some mental 
or perceptual structure of the object. 

Perceptual structure refers to the relationships within what one perceives 
4C5araei^t^. r -l-956r Hodibwg,-1956) r that 4s, to information about the 
object that can be retrieved from the viewer— for example, that the sides 
of a cube look equal and parallel, that the vertical edge at 7 in Figure 12E 
(see p. 266) looks farther than the horizontal edge it intersects when the 
vertical edge looks the nearer at 2. To the degree that perceptual structure 
reflects the structure of the physical stimulus, physical analyses of optical 
information will serve to model the perceptual process; obviously, as long 
as we stipulate that some object, say, a wire cube, is perceived correctly, 
the layou' of the physical cube itself must serve to predict the relative 
apparent nearnesses of its parts. The simplicity of this task is of course 
what makes the more extreme direct theories so attractive. To the degree 
that perceptual structure reflects known (or hypothetical) neurological struc- 
ture, however, the latter must also modify any attempts to relate what die 
viewer perceives, on the one hand, to the optical structure of the object or 
scene that confronts the perceiver, on the other. Thus, what we know about 
the distribution of acuity over the retina or what we think we know about 
spatial frequency channels must be used in attempting to predict the effects 
of the information that could otherwise be provided by the optical structure. 
Computer models of perception can incorporate both kinds of structure with 
very little input from psychological research. 

To the degree that perceptual structure reflects none of these — that is, 
to the degree that it expresses what we may call mental structure — per- 
ceptual research must provide the facts that are needed for any theories, 
whether or not those theories are embodied in computer models. Such facts 
are obtainable but sparse; for this reason, computer science has as yet very 
little to say about the modeling of mental structure. Some terms have been 
offered (e.g., Minsky's "frames" [1975], roughly equivalent to an expec- 
tation or a schema), but terms or even models are not needed here so much 
as facts, and more attention paid to what facts we do have. We next consider 
very briefly the current state of research on mental structure in real and 
represented objects. 

MENTAL STRUCTURE IN OBJECT PERCEPTION 
AND REPRESENTATION 

Where perception can be predicted from the pattern of stimulation that 
falls on the sense organs, we are free to argue that the stimulus pattern 
itself (as transformed and limited by the sensory system) determines what 
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we perceive. Of course, we are also free to hold that there are other factors 
at work as welL The fact is that we have a long history of demonstrations 
that sensory stimulus information is neither necessary nor sufficient to 
determine what we perceive. 

One source of such evidence is provided by completion phenomena, 
examples of which are shown in Figures 18 and 19. These should not be 
dismissed as strained: In a normally cluttered world, such interrupted and 
fragmented shapes must be the rule rather than the exception. Nor can our 
perceptions of these shapes be profitably ascribed either to complex sensory 
structures (such as receptive fields and frequency channels) or to invariant 
stimulus information. The perception ; a single object rather than of sep- 
arate fragments often depends on the viewer's having specific knowledge 
of what that object normally looks like, and on being ready to perceive it. 
That is very much what Mill and Helmholtz meant by perception. A nice 
demonstration to which Dallenbach called attention in 1951 is shown in 
Figure 19A, in which few viewers can discern any clear object. After 
looking at Figure 19B, however, it is remarkably difficult not to see that 
same object when looking at Figure 19 A. 





FIGURE 18 Completion phenomena. A selection of simple geometric shapes: at A, 
a square; at B, a circle; at C, a circle, triangle, and square; at D, a cube. In fact, the 
fragments that are shown do not by themselves define or specify any shape. It might 
be, for example, that C consists of the block letters CAT, partially occluded. 
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Completion phenomena have been known to psychologists for more than 
a century (and to artists, of course, for much longer). While the classical 
theory prevailed (see Figure 6, p. 254), such demonstrations were taken 
for granted and assumed to reveal the pattern of associations (or sensory 
expectations) that each viewer has learned from experience with any object. 
Although viewers would differ in their individual perceptual histories, the 
structure of their sensory expectations or associations should nevertheless 
reflect at least grossly the covariations or contingencies of the physical 
world — as filtered through their limited sensory systems. That is, mental 
structure should be predictable from measures of physical structure (e.g., 
Brans wik, 1956), once sensory limits are taken into account. 

In some cases, mental structure does indeed seem to be at least approx- 
imately that of physical structure (e.g., the "constancies" described in 
connection with Figure 7), but we also have had dramatic counterexamples 
for the past 30 years (Figures 12F, 16A). Some psychologists still adhere 
to "unconscious inference" explanations today (Gregory, 1970; Rock, 1977), 
but such counterexamples make that proposal as it now stands an empty 
one. To mean anything at all, the premises of such supposed inferences 
must be investigated and not simply taken to be the same as the structure 
of the physical world. Moreover, as we have discussed at length, the last 
30 years have also shown that much of perceptual structure may be given 
directly by complex neurophysiological circuitry; if that is at all true, such 
prewired perceptual structure must surely affect the nature and use of what- 
ever mental structure does exist in addition. For example, for all we know 
at present the Ames trapezoid phenomenon (Figure 16) may result not from 
unconscious inference but from some direct sensory mechanism that pro- 
vides a salient illusion only in certain conditions (see Figure 16C, p. 274; 
Hochberg, 1984b). 

We need, therefore, to study mental structure and to measure its char- 
acteristics. The very topic has an aura of ^substantiality, until recently 
anathema to many psychologists eager to avoid subjectivity and mentalism. 
To study how a person perceives some object we must in one way or another 
ask him or her questions about that object — retrieve information from the 
subject about the object. In cases like the completion phenomena, we must 
ask the viewer questions about an object that is not in fact present and for 
which only that absent virtual object, <uid the few stimulus fragments ac- 
tually shown to the viewer, can be confidently described in physical terms. 
That fact is a challenge but not an insuperable obstacle. There is actually 
a considerable body of research with a much more extreme experimental 
situation: Since Galton (1883) first undertook to study individual differences 
in mental imagery, methods have existed for studying how well individuals 
can retrieve information about objects for which no stimulus information 
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whatsoever is present. Such "objective tests of imagery" (see Woodworth, 
1938) have seen increasing use in recent years, but rather than being used 
to probe individual differences in imagery, most present research is directed 
to examining the nature of the imagery process itself (Kosslyn, 1980), a 
task that faces many of the same challenges as does the task of studying 
mental structure in perception. Whether such imagery studies have sub- 
stantial implications for perception is unclear. We do not know whether 
imagery, studied with no stimuli present, is related in any simple way to 
the mental structure that is involved in the perception of partially present 
objects. That can only be answered by research on the process by which 
mental structure informs and accepts sensory information. 

The need to fit fragments of sensory information into some mental struc- 
ture is pervasive in normal perception. The perception of objects that are 
partially obscured in normally cluttered environments must often draw on 
a process of fitting fragmentary sensory information into a previously pro- 
vided mental structure (Figure 19). 

In addition, our perceptions of any scene or moderately large object must 
be assembed over time by means of successive glances, each of which 
provides only a partial view of the world. Finally , as objects are temporarily 
obscured by nearer ones (as viewer, object or both move through the world), 
we must be able to keep track of their motions even while they are out of 
sight, and to recognize them when they reappear.. Both of these functions 
are drawn upon in our perceptions of real objects in the world and also in 



FIGURE 19A A completion figure. The mysterious object shown in this high contrast 
photograph is more readily apparent in Figure 19B on page 286 (reprinted, with per- 
mission, from Da] I cn bach, 1951). 
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FIGURE 19B The same cow shown in 19A (reprinted, with permission, from Dal- 
lenbach, 1951). Once the object has been seen in Figure 19B, it is remarkably difficult 
to avoid seeing it in 19A as well. 

film and video, as cameras cut from one scene to another (both successfully 
and unsuccessfully). And both functions suggest methods by which mental 
structure may be studied. We consider these in turn. 

In the normal process of directing our gaze at different parts of some 
object, each glimpse offers detailed vision only in a small central part of 
the retina. The information gained by the successive fragmentary glimpses 
(as many as four per second) must therefore be integrated by some non- 
sensory process into a single perception. Similarly, in virtually all motion 
picture or video sequences, successive closeup views or shots each provide 
a partial view of some scene that may never be shown in its entirety (which 
would be a long shot) and may in fact not exist at all save in the mind's 
eye of the viewer. This is a kind of completion over time, of central 
importance to perceptual theory and application, that could not be studied 
at all until the last decade, when motion pictures and high-speed computer 
graphics became generally available as laboratory tools. 

There has as yet been little more research on this aspect of object per- 
ception than to show that such research is possible. The row of circles in 
Figure 20 represents a sequence of successive views that simulate a sta- 
tionary circular aperture through which the individual corners of some ob ject 
that is being moved about behind the screen — in this case, a cross — are 
visible. If the motions of the comers were themselves visible, the viewer 
could construct the entire object behind the screen in his mind's eye, de- 
tecting, for example, that a specific arm of the cross has been skipped 
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FIGURE 20 Other completion figures. A sequence of right angles, presented at rates 
of 333 to 2000 msc per view, is shown at A. Subjects who are shown two such sequences, 
which may or may not differ in one or more views near the middle of the sequence, 
cannot tell better than chance (the baseline in C) whether the sequences are the same 
or different, because each sequence of views, considered as independent items, far 
exceeds their memory span. In each pair of sequences, at least one sequence is in fact 
a systematic succession of closeups of the corners of a cross. If each sequence is 
introduced by a long shot and a medium shot, as at B, which establishes the overall 
object and the starting point of the sequence, then each view of the unchanged sequence 
takes its place in rum within the structure that the viewer has in mind, whereas the 
altered sequence does not, and the difference between the two sequences (which are 
no longer strings of independent events) becomes evident, within the time limits indicated 
ctC. 



within the sequence. If the motions are not visible, as they are not in this 
experiment, then the sequence of static views is indecipherable and in fact 
cannot be kept in mind; if a long shot of the object is presented first, 
however (as in row B), providing a mental structure within which the 
successive views can take their place, the subject can again perceive the 
object that is moving behind the aperture (Hochberg, 1978a). It is the mental 
structure of the object that makes the stimulus sequence comprehensible. 
Given the long shot and the structure it provides, tv/o sequences that are 
different are perceived as such; without the structure, the viewer cannot 
distinguish one sequence from another. 

When a pedestrian you are watching is lost from view while he passes 
behind a parked truck, or while you divert your gaze to the traffic light, 
yov must still be able to tell approximately when he will return to view 
from behind the truck, or where he will have gotten to when you look back 
from the traffic light. Such predictive functions, for which we can surely 



235 



288 



JULIAN HOCHBERG 



find ample evolutionary demands, imply that something that corresponds 
to motion through space occurs in the mind's eye of the viewer. The 
filmmaker or graphics programmer who cuts away from one event to an- 
other, and then returns to the first one, must make some assumptions about 
how well the viewer keeps track of any motion that L going on in the first 
event. The following research shows that discussing such mental motion 
is more than just a poetic metaphor. 

Shepard and his colleagues hud shown in a wide range of experiments 
that the time subjects need to judge whether two objects are the same or 
different (Figure 21 A) is proportional to the angle between them, as though 
one object were being mentally rotated at some constant rate to bring it to 
the same orientation as the other in making the comparison (Shepard and 
Cooper, 1982; Shepard and Metzler, 1971). Using that paradigm, Cooper 
(1976) first determined each subject's characteristic "mental rotation" rate, 
a), and then, after having had subjects memorize the figures, displayed the 
comparison figure at some variable angle (<f>) and delayed after a starting 
signalby a variable mterval(t). She found that if the productof<o x (t) = (<f>), 
judgment times no longer increased with angle (<J>): they were now inde- 
pendent of the angle between the two objects being compared (Figure 2 IB). 
The results are what one would expect if the object had in fact been rotated 
at angular velocity a> x (t) between presentations i and ii, and if both 
objects had come to the same orientation by the time the comparison was 
called for. 

Given these findings, "mental rotation" seems more than a metaphor 
that summarizes the fact that judgment time is a function of angle (<f>) in 
Figure 21 A. It implies a usable and consistent relationship between time 
and distance in a mental structure that cannot be attributed to physical 
stimulus information. 

A third and quite different paradigm, which appears in a recent technical 
report by Cooper (1984) on work in progress, may tell us something more 
general about the form in which perceived ejects are manipulated and 
stored and may also eventually provide a tool with which to compare how 
well different methods of representation accord v/ith the ways in which 
objects are perceived and remembered. Subjects had been given two or- 
thographic projections of an object (a and b in Figure 22) and were to judge 
whether a third orthographic projection (c) was of the same or of a different 
object. No isometric projections (e.g., c) of any objects were shown to the 
subjects at this time. Subsequently the subjects were shown a set of isometric 
projections (e.g., c, f)» some of which represented the objects used in the 
previous tasks and some of which did not. Subjects tended to report that 
they had seen the former before, even though no isometric pictures at all 
had been shown. Although this research is still in progress, and various 
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B 

FIGURE 21 Mental rotation. A: Given two objects at different orientations, the time 
that it takes to judge whether they are the same or different ; s a function of the angle 
between their orientations, whether in the picture plane i, ii or in depth iii, iv. It is as 
though the subject must rotate one object into the orientation of the other before the 
two can be compared (Shepard and Mctzler, 1971). B: If the two shapes to be compared 
i, ii are presented simultaneously (i.e., separated in time by an interval t = 0.0) their 
reaction time R.T. increases with angle, cj>, between their orientations, as above. But 
if the comparison figure is presented after an interval / = cf>/io, where to is the subject's 
characteristic rotation rate (obtained from the slope of the function at t = 0.0), then 
the R.T. does not increase with increasing angle <|> (Cooper, 1976). This is just what 
one would mean by saying that the subjects had rotated the object before making the 
comparison. 
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FIGURE 22 The structure of perceived objects — orthographic and isometric projec- 
tions, fwo different pictorial systems are shown here: Pictures a and b and pictures d 
and e are orthographic projections of objects whose respective isometric projections are 
c and f. Isometric projections are easier to grasp, at least for these objects. Cooper 
(1984) presents preliminary evidence that even when subjects have been presented only 
with orthographic projections of objects, they tend to report later that they hr.ve seen 
isometric projections of those objects. 

controls are needed, the preliminary results will, I feel certain, survive the 
necessary replication and controls: Orthographic and isometric projections 
can both specify the form of a three-dimensional object, but the isometric 
projections are in some sense closer to the way in which we extract and 
store the information — closer to the mental structure involved in perceiving 
and comparing the objects. Although I know of no research to the point, 
it hardly needs an experiment to discover that isometric pictures are more 
rapidly and accurately comprehended than orthographic ones. What ex- 
periments can do is give us a better understanding of why that is so, and 
of the sense in which the isometric picture is more like the structure that 
underlies our perception of the object. 

These three research procedures that I have described in connection with 
Figures 20 through 22 are interesting more as examples of a field of ex- 
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perimental and quantitative inquiry than as demonstrations that mental pro- 
cesses can be studied and are in that sense real; the latter is not a new 
conception. It has repeatedly come into and gone out of scientific fashion, 
and merely showing that mental structure "exists," in some sense, will 
not add much to its history. Fortunately, this time there are vested interests 
in obtaining and systematizing the knowledge, and technical facilities for 
doing so, that should keep research and theory centered on these problems 
of object perception and representation for some time to come. 
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